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NUCLEIC ACID SEQUENCES TO PROTEINS INVOLVED IN TOCOPHEROL 

SYNTHESIS 



INTRODUCTION 



This application claims the benefit of the filing date of the provisional Application 
U.S. Serial Number 60/129,899, filed April 15, 1999, and the provisional Application, U.S. 
10 Serial Number 60/146,461, filed July 30, 1999. 



TECHNICAL FIELD 

The present invention is directed to nucleic acid and amino acid sequences and 
constructs, and methods related thereto. 

15 

BACKGROUND 

Isoprenoids are ubiquitous compounds found in all living organisms. Plants synthesize 
a diverse array of greater than 22,000 isoprenoids (Connolly and Hill (1992) Dictionary of 
Terpenoids, Chapman and Hall, New York, NY). In plants, isoprenoids play essential roles in 
2 0 particular cell functions such as production of sterols, contributing to eukaryotic membrane 
architecture, acyclic polyprenoids found in the side chain of ubiquinone and plastoquinone, 
growth regulators like abscisic acid, gibberellins, brassinosteroids or the photosynthetic 
pigments chlorophylls and carotenoids. Although the physiological role of other plant 
isoprenoids is less evident, like that of the vast array of secondary metabolites, some are 

2 5 known to play key roles mediating the adaptative responses to different environmental 

challenges. In spite of the remarkable diversity of structure and function, all isoprenoids 
originate from a single metabolic precursor, isopentenyl diphosphate (IPP) (Wright, (1961) 
AnniL Rev. Biochem. 20:525-548; and Spurgeon and Porter, (1981) in Biosynthesis of 
Isoprenoid Compounds ., Porter and Spurgeon eds (John Wiley, New York) Vol. 1, ppl-46). 

3 0 A number of unique and interconnected biochemical pathways derived from the 

isoprenoid pathway leading to secondary metabolites, including tocopherols, exist in 
chloroplasts of higher plants. Tocopherols not only perform vital functions in plants, but are 
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also important from mammalian nutritional perspectives. In plastids, tocopherols account for 

up to 40% of the total quinone pool. 

Tocopherols and tocotrienols (unsaturated tocopherol derivatives) are well known 
antioxidants, and play an important role in protecting cells from free radical damage, and in 
the prevention of many diseases, including cardiac disease, cancer, cataracts.retinopathy, 
Alzheimer's disease, andneurodegeneration, and have been shown to have beneficial effects 
on symptoms of arthritis, and in anti-aging. Vitamin E is used in chicken feed for improving 
the shelf life, appearance, flavor, and oxidative stability of meat, and to transfer tocols from 
feed to eggs. Vitamin E has been shown to be essential for normal reproduction, improves 
overall performance, and enhances immunocompetence in livestock animals. Vitamin E 
supplement in animal feed also imparts oxidative stability to milk products. 

The demand for natural tocopherols as supplements has been steadily growing at a 
rate of 10-20% for the past three years. At present, the demand exceeds the supply for natural 
tocopherols, which are known to be more biopotent than racemic mixtures of synthetically 
produced tocopherols. Naturally occurring tocopherols are all d-stereomers, whereas 
synthetic cc-tocopherol is a mixture of eight J-oc-tocopherol isomers, only one of which 
(12.5%) is identical to the natural d-cc-tocopherol. Natural d-cc-tocopherol has the highest 
vitamin E activity (1.49 IU/mg) when compared to other natural tocopherols or tocotrienols. 
The synthetic a-tocopherol has a vitamin E activity of 1.1 IU/mg. In 1995, the worldwide 
market for raw refined tocopherols was $1020 million; synthetic materials comprised 85-88% 
of the market, the remaining 12-15% being natural materials. The best sources of natural 
tocopherols and tocotrienols are vegetable oils and grain products. Currently, most of the 
natural Vitamin E is produced from y-tocopherol derived from soy oil processing, which is 
subsequently converted to a-tocopherol by chemical modification (a-tocopherol exhibits the 

greatest biological activity). 

Methods of enhancing the levels of tocopherols and tocotrienols in plants, especially 
levels of the more desirable compounds that can be used directly, without chemical 
modification, would be useful to the art as such molecules exhibit better functionality and 
biovailability. 

In addition, methods for the increased production of other isoprenoid derived compounds 
in a host plant cell is desirable. Furthermore, methods for the production of particular isoprenoid 
compounds in a host plant cell is also needed. 
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SUMMARY OF THE INVENTION 

5 The present invention is directed to prenyltransferase (PT), and in particular to PT 

polynucleotides and polypeptides. The polynucleotides and polypeptides of the present 
invention include those derived from prokaryotic and eukaryotic sources. 

Thus, one aspect of the present invention relates to isolated polynucleotide sequences 
encoding prenyltransferase proteins. In particular, isolated nucleic acid sequences encoding 
10 PT proteins from bacterial and plant sources are provided. 

Another aspect of the present invention relates to oligonucleotides which include 
partial or complete PT encoding sequences. 

It is also an aspect of the present invention to provide recombinant DNA constructs 
which can be used for transcription or transcription and translation (expression) of 
15 prenyltransferase. In particular, constructs are provided which are capable of transcription or 
transcription and translation in host cells. 

In another aspect of the present invention, methods are provided for production of 
prenyltransferase in a host cell or progeny thereof. In particular, host cells are transformed or 
transfected with a DNA construct which can be used for transcription or transcription and 
2 0 translation of prenyltransferase. The recombinant cells which contain prenyltransferase are 
also part of the present invention. 

In a further aspect, the present invention relates to methods of using polynucleotide 
and polypeptide sequences to modify the tocopherol content of host cells, particularly in host 
plant cells. Plant cells having such a modified tocopherol content are also contemplated 
25 herein. 

The modified plants, seeds and oils obtained by the expression of the 
prenyltransferases are also considered part of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

30 

Figure 1 provides an amino acid sequence alignment between ATPT2, ATPT3, 
ATPT4, ATPT8, and ATPT12 are performed using ClustalW. 



3 



WO 00/6339] 



PCT/US00/10368 



Figure 2 provides a schematic picture of the expression construct pCGN 10800. 
Figure 3 provides a schematic picture of the expression construct pCGN 10801. 
Figure 4 provides a schematic picture of the expression construct pCGN 10803. 
Figure 5 provides a schematic picture of the expression construct pCGN 10806. 
Figure 6 provides a schematic picture of the expression construct pCGN 10807. 
Figure 7 provides a schematic picture of the expression construct pCGN 10808. 
Figure 8 provides a schematic picture of the expression construct pCGN 10809. 
Figure 9 provides a schematic picture of the expression construct pCGN10810. 
Figure 10 provides a schematic picture of the expression construct pCGN 10811. 
Figure 11 provides a schematic picture of the expression construct pCGN10812. 
Figure 12 provides a schematic picture of the expression construct pCGN10813. 

Figure 13 provides a schematic picture of the expression construct pCGN10814. 

Figure 1 4 provides a schematic picture of the expression construct pCGN 10815. 

Figure 15 provides a schematic picture of the expression construct pCGN10816. 

Figure 16 provides a schematic picture of the expression construct pCGN10817. 

Figure 17 provides a schematic picture of the expression construct pCGN10819. 

Figure 18 provides a schematic picture of the expression construct pCGN 10824. 

Figure 19 provides a schematic picture of the expression construct pCGN10825. 

Figure 20 provides a schematic picture of the expression construct pCGN 10826. 

Figure 21 provides an amino acid sequence alignment using ClustalW between the 

Synechocystis sequence knockouts. 

Figure 22 provides an amino acid sequence of the ATPT2, ATPT3, ATPT4, ATPT8, 
and ATPT12 protein sequences from Arabidopsis and the slrl736, slr0926, slll899, slr0056, 
and the sir 15 18 amino acid sequences from Synechocystis. 

Figure 23 provides the results of the enzymatic assay from preparations of wild type 
Synechocystis strain 6803, and Synechocystis slrl736 knockout. 

Figure 24 provides bar graphs of HPLC data obtained from seed extracts of transgenic 
Arabidopsis containing pCGN 10822, which provides of the expression of the ATPT2 
sequence, in the sense orientation, from the napin promoter. Provided are graphs for alpha, 
gamma, and delta tocopherols, as well as total tocopherol for 22 transformed lines, as well as 
a nontransformed (wildtype) control. 

Figure 25 provides a bar graph of HPLC analysis of seed extracts from Arabidopsis 
plants transformed with pCGN10803 (35S-ATPT2, in the antisense orientation), pCGN10802 
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(line 1625, napin ATPT2 in the sense orientation), pCGN10809 (line 1627, 35S-ATPT3 in 
the sense orientation), a nontransformed (wt) constrol, and a empty vector transformed 
control. 

5 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides, inter alia, compositions and methods for altering (for 
example, increasing and decreasing) the tocopherol levels and/or modulating their ratios in 

10 host cells. In particular, the present invention provides polynucleotides, polypeptides, and 
methods of use thereof for the modulation of tocopherol content in host plant cells. 

The present invention provides polynucleotide and polypeptide sequences involved in 
the prenylation of straight chain and aromatic compounds. Straight chain prenyl transferases 
as used herein comprises sequences which encode proteins involved in the prenylation of 

15 straight chain compounds, including, but not limited to, geranyl geranyl pyrophosphate and 
farnesyl pyrophosphate. Aromatic prenyl transferases, as used herein, comprises sequences 
which encode proteins involved in the prenylation of aromatic compounds, including, but not 
limited to, menaquinone, ubiquinone, chlorophyll, and homogentisic acid. The prenyl 
transferase of the present invention preferably prenylates homogentisic acid. 

2 0 The biosynthesis of oc-tocopherol in higher plants involves condensation of 

homogentisic acid and phytylpyrophosphate to form 2-methyl-6phytylbenzoquinol that can, 
by cyclization and subsequent methylations (Fiedler et al., 1982, Planta, 155: 5 1 1-515, Soli 
et al., 1980, Arch. Biochem. Biophys. 204: 544-550, Marshall et al., 1985 Phytochem., 24: 
1705-171 1, all of which are herein incorporated by reference in their entirety), form various 
25 tocopherols. The Arabidopsis pds2 mutant identified and characterized by Norris et al 
(1995), is deficient in tocopherol and plastiquinone-9 accumulation. Further genetic and 
biochemical analysis suggests that the protein encoded by PDS2 may be responsible for the 
prenylation of homogentisic acid. This may be a rate limiting step in tocopherol biosynthesis, 
and this gene has yet to be isolated. Thus, it is an aspect of the present invention to provide 

3 0 polynucleotides and polypeptides involved in the prenylation of homogentisic acid. 

Isolated Polynucleotides, Proteins, and Polypeptides 
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A first aspect of the present invention relates to isolated prenyltransferase 
polynucleotides. The polynucleotide sequences of the present invention include isolated 
polynucleotides that encode the polypeptides of the invention having a deduced amino acid 
5 sequence selected from the group of sequences set forth in the Sequence Listing and to other 
polynucleotide sequences closely related to such sequences and variants thereof. 

The invention provides a polynucleotide sequence identical over its entire length to 
each coding sequence as set forth in the Sequence Listing. The invention also provides the 
coding sequence for the mature polypeptide or a fragment thereof, as well as the coding 

1 0 sequence for the mature polypeptide or a fragment thereof in a reading frame with other 
coding sequences, such as those encoding a leader or secretory sequence, a pre-, pro-, or 
prepro- protein sequence. The polynucleotide can also include non-coding sequences, 
including for example, but not limited to, non-coding 5' and 3' sequences, such as the 
transcribed, untranslated sequences, termination signals, ribosome binding sites, sequences 

15 that stabilize mRNA, introns, polyadenylation signals, and additional coding sequence that 

encodes additional amino acids. For example, a marker sequence can be included to facilitate 
the purification of the fused polypeptide. Polynucleotides of the present invention also 
include polynucleotides comprising a structural gene and the naturally associated sequences 
that control gene expression. 

2 0 The invention also includes polynucleotides of the formula: 

X-(RiV(R2)-(R3)n-Y 

wherein, at the 5' end, X is hydrogen, and at the 3' end, Y is hydrogen or a metal, R and R 3 
are any nucleic acid residue, n is an integer between 1 and 3000, preferably between 1 and 
1000 and R 2 is a nucleic acid sequence of the invention, particularly a nucleic acid sequence 
25 selected from the group set forth in the Sequence Listing and preferably those of SEQ ID 

NOs: 1, 3, 5, 7, 8, 10, 1 1, 13-16, 18, 23, 29, 36, and 38. In the formula, R 2 is oriented so that 
its 5' end residue is at the left, bound to R lf and its 3' end residue is at the right, bound to R 3 . 
Any stretch of nucleic acid residues denoted by either R group, where R is greater than 1, may 
be either a heteropolymer or a homopolymer, preferably a heteropolymer. 

3 o The invention also relates to variants of the polynucleotides described herein that 

encode for variants of the polypeptides of the invention. Variants that are fragments of the 
polynucleotides of the invention can be used to synthesize full-length polynucleotides of the 
invention. Preferred embodiments are polynucleotides encoding polypeptide variants wherein 
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5 to 10, 1 to 5, 1 to 3, 2, 1 or no amino acid residues of a polypeptide sequence of the 
invention are substituted, added or deleted, in any combination. Particularly preferred are 
substitutions, additions, and deletions that are silent such that they do not alter the properties 
or activities of the polynucleotide or polypeptide. 

Further preferred embodiments of the invention that are at least 50%, 60%, or 70% 
identical over their entire length to a polynucleotide encoding a polypeptide of the invention, 
and polynucleotides that are complementary to such polynucleotides. More preferable are 
polynucleotides that comprise a region that is at least 80% identical over its entire length to a 
polynucleotide encoding a polypeptide of the invention and polynucleotides that are 
complementary thereto. In this regard, polynucleotides at least 90% identical over their entire 
length are particularly preferred, those at least 95% identical are especially preferred. Further, 
those with at least 97% identity are highly preferred and those with at least 98% and 99% 
identity are particularly highly preferred, with those at least 99% being the most highly 
preferred. 

Preferred embodiments are polynucleotides that encode polypeptides that retain 
substantially the same biological function or activity as the mature polypeptides encoded by 
the polynucleotides set forth in the Sequence Listing. 

The invention further relates to polynucleotides that hybridize to the above-described 
sequences. In particular, the invention relates to polynucleotides that hybridize under 
stringent conditions to the above-described polynucleotides. As used herein, the terms 
"stringent conditions" and "stringent hybridization conditions" mean that hybridization will 
generally occur if there is at least 95% and preferably at least 97% identity between the 
sequences. An example of stringent hybridization conditions is overnight incubation at 42°C 
in a solution comprising 50% formamide, 5x SSC (150 mM NaCl, 15 mM trisodium citrate), 
50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% dextran sulfate, and 20 
micrograms/milliliter denatured, sheared salmon sperm DNA, followed by washing the 
hybridization support in 0.1 x SSC at approximately 65°C. Other hybridization and wash 
conditions are well known and are exemplified in Sambrook, et ai, Molecular Cloning: A 
Laboratory Manual, Second Edition, cold Spring Harbor, NY (1989), particularly Chapter 11. 

The invention also provides a polynucleotide consisting essentially of a 
polynucleotide sequence obtainable by screening an appropriate library containing the 
complete gene for a polynucleotide sequence set for in the Sequence Listing under stringent 
hybridization conditions with a probe having the sequence of said polynucleotide sequence or 
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a fragment thereof; and isolating said polynucleotide sequence. Fragments useful for 
obtaining such a polynucleotide include, for example, probes and primers as described herein. 

As discussed herein regarding polynucleotide assays of the invention, for example, 
polynucleotides of the invention can be used as a hybridization probe for RNA, cDNA, or 
genomic DNA to isolate full length cDNAs or genomic clones encoding a polypeptide and to 
isolate cDNA or genomic clones of other genes that have a high sequence similarity to a 
polynucleotide set forth in the Sequence Listing. Such probes will generally comprise at least 
15 bases. Preferably such probes will have at least 30 bases and can have at least 50 bases. 
Particularly preferred probes will have between 30 bases and 50 bases, inclusive. 

The coding region of each gene that comprises or is comprised by a polynucleotide 
sequence set forth in the Sequence Listing may be isolated by screening using a DNA 
sequence provided in the Sequence Listing to synthesize an oligonucleotide probe. A labeled 
oligonucleotide having a sequence complementary to that of a gene of the invention is then 
used to screen a library of cDNA, genomic DNA or mRNA to identify members of the library 
which hybridize to the probe. For example,* synthetic oligonucleotides are prepared which 
correspond to the prenyl transferase EST sequences. The oligonucleotides are used as primers 
in polymerase chain reaction (PCR) techniques to obtain 5' and 3* terminal sequence of 
prenyl transferase genes. Alternatively, where oligonucleotides of low degeneracy can be 
prepared from particular prenyl transferase peptides, such probes may be used directly to 
screen gene libraries for prenyltransferase gene sequences. In particular, screening of cDNA 
libraries in phage vectors is useful in such methods due to lower levels of background 
hybridization. 

Typically, a prenyltransferase sequence obtainable from the use of nucleic acid probes 
will show 60-70% sequence identity between the target prenyltransferase sequence and the 
encoding sequence used as a probe. However, lengthy sequences with as little as 50-60% 
sequence identity may also be obtained. The nucleic acid probes may be a lengthy fragment 
of the nucleic acid sequence, or may also be a shorter, oligonucleotide probe. When longer 
nucleic acid fragments are employed as probes (greater than about 100 bp), one may screen at 
lower stringencies in order to obtain sequences from the target sample which have 20-50% 
deviation (i.e., 50-80% sequence homology) from the sequences used as probe. 
Oligonucleotide probes can be considerably shorter than the entire nucleic acid sequence 
encoding an prenyltransferase enzyme, but should be at least about 10, preferably at least 
about 15, and more preferably at least about 20 nucleotides. A higher degree of sequence 
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identity is desired when shorter regions are used as opposed to longer regions. It may thus be 
desirable to identify regions of highly conserved amino acid sequence to design 
oligonucleotide probes for detecting and recovering other related prenyltransferase genes. 
Shorter probes are often particularly useful for polymerase chain reactions (PCR), especially 
5 when highly conserved sequences can be identified. (See, Gould, et at, PNAS USA (1989) 
86: 1934- 1938.). 

Another aspect of the present invention relates to prenyltransferase polypeptides. 
Such polypeptides include isolated polypeptides set forth in the Sequence Listing, as well as 
polypeptides and fragments thereof, particularly those polypeptides which exhibit 

10 prenyltransferase activity and also those polypeptides which have at least 50%, 60% or 70% 
identity, preferably at least 80% identity, more preferably at least 90% identity, and most 
preferably at least 95% identity to a polypeptide sequence selected from the group of 
sequences set forth in the Sequence Listing, and also include portions of such polypeptides, 
wherein such portion of the polypeptide preferably includes at least 30 amino acids and more 

15 preferably includes at least 50 amino acids. 

"Identity", as is well understood in the art, is a relationship between two or more 
polypeptide sequences or two or more polynucleotide sequences, as determined by comparing 
the sequences. In the art, "identity" also means the degree of sequence relatedness between 
polypeptide or polynucleotide sequences, as determined by the match between strings of such 

2 0 sequences. "Identity" can be readily calculated by known methods including, but not limited 

to, those described in Computational Molecular Biology, Lesk, A.M., ed., Oxford University 
Press, New York (1988); Biocomputing: Informatics and Genome Projects, Smith, D.W., ed., 
Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part /, Griffin, 
A.M. and Griffin, H.G., eds., Humana Press, New Jersey (1994); Sequence Analysis in 
25 Molecular Biology, von Heinje, G., Academic Press (1987); Sequence Analysis Primer, 

Gribskov, M. and Devereux, J., eds., Stockton Press, New York (1991); and Carillo, H., and 
Lipman, D., SIAM J Applied Math, 48:1073 (1988). Methods to determine identity are 
designed to give the largest match between the sequences tested. Moreover, methods to 
determine identity are codified in publicly available programs. Computer programs which 

3 0 can be used to determine identity between two sequences include, but are not limited to, GCG 

(Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); suite of five BLAST 
programs, three designed for nucleotide sequences queries (BLASTN, BLASTX, and 
TBLASTX) and two designed for protein sequence queries (BLASTP and TBLASTN) 
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(Coulson, Trends in Biotechnology, 12: 76-80 (1994); Birren, etaU Genome Analysis, 1: 
543-559 (1997)). The BLAST X program is publicly available from NCBI and other sources 
{BLAST Manual Altschul, S., et al, NCBI NLM NIH, Bethesda, MD 20894; Altschul, S., et 
al, J- Mol Biol, 215:403-410 (1990)). The well known Smith Waterman algorithm can also 
be used to determine identity. 

Parameters for polypeptide sequence comparison typically include the following: 
Algorithm: Needleman and Wunsch, J. Mol Biol 48:443-453 (1970) 
Comparison matrix: BLOSSUM62 from Hentikoff and Hentikoff, Proa Natl Acad. 
Sci USA 89:10915-10919 (1992) 
Gap Penalty: 12 
Gap Length Penalty: 4 

A program which can be used with these parameters is publicly available as the "gap" 
program from Genetics Computer Group, Madison Wisconsin. The above parameters along 
with no penalty for end gap are the default parameters for peptide comparisons. 

Parameters for polynucleotide sequence comparison include the following: 

Algorithm: Needleman and Wunsch, J. Mol. Biol. 48:443-453 (1970) 

Comparison matrix: matches = +10; mismatches = 0 

Gap Penalty: 50 

Gap Length Penalty: 3 

A program which can be used with these parameters is publicly available as the "gap" 
program from Genetics Computer Group, Madison Wisconsin. The above parameters are the 
default parameters for nucleic acid comparisons. 

The invention also includes polypeptides of the formula: 

X-(Ri) n -(R2)-(R3)n-Y 

wherein, at the amino terminus, X is hydrogen, and at the carboxyl terminus, Y is hydrogen or 
a metal, Ri and R3 are any amino acid residue, n is an integer between 1 and 1000, and R2 is 
an amino acid sequence of the invention, particularly an amino acid sequence selected from 
the group set forth in the Sequence Listing and preferably those encoded by the sequences 
provided in SEQ ID NOs: 2, 4, 6, 9, 12, 17, 19-22, 24-28, 30, 32-35, 37, and 39. In the 
formula, R 2 is oriented so that its amino terminal residue is at the left, bound to Ri, and its 
carboxy terminal residue is at the right, bound to R 3 . Any stretch of amino acid residues 
denoted by either R group, where R is greater than 1, may be either aheteropolymer or a 
homopolymer, preferably aheteropolymer. 
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Polypeptides of the present invention include isolated polypeptides encoded by a 
polynucleotide comprising a sequence selected from the group of a sequence contained in the 
Sequence Listing set forth herein . 

The polypeptides of the present invention can be mature protein or can be part of a 

5 fusion protein. 

Fragments and variants of the polypeptides are also considered to be a part of the 
invention. A fragment is a variant polypeptide which has an amino acid sequence that is 
entirely the same as part but not all of the amino acid sequence of the previously described 
polypeptides. The fragments can be "free-standing" or comprised within a larger polypeptide 

10 of which the fragment forms a part or a region, most preferably as a single continuous region. 
Preferred fragments are biologically active fragments which are those fragments that mediate 
activities of the polypeptides of the invention, including those with similar activity or 
improved activity or with a decreased activity. Also included are those fragments that 
antigenic or immunogenic in an animal, particularly a human. 

15 Variants of the polypeptide also include polypeptides that vary from the sequences set 

forth in the Sequence Listing by conservative amino acid substitutions, substitution of a 
residue by another with like characteristics. In general, such substitutions are among Ala, 
Val, Leu and He; between Ser and Thr; between Asp and Glu; between Asn and Gin; between 
Lys and Arg; or between Phe and Tyr. Particularly preferred are variants in which 5 to 10; 1 

20 to 5; 1 to 3 or one amino acid(s) are substituted, deleted, or added, in any combination. 

Variants that are fragments of the polypeptides of the invention can be used to 
produce the corresponding full length polypeptide by peptide synthesis. Therefore, these 
variants can be used as intermediates for producing the full-length polypeptides of the 
invention. 

2 5 The polynucleotides and polypeptides of the invention can be used, for example, in 

the transformation of host cells, such as plant host cells, as further discussed herein. 

The invention also provides polynucleotides that encode a polypeptide that is a mature 
protein plus additional amino or carboxyl-terminal amino acids, or amino acids within the 
mature polypeptide (for example, when the mature form of the protein has more than one 

3 0 polypeptide chain). Such sequences can, for example, play a role in the processing of a 

protein from a precursor to a mature form, allow protein transport, shorten or lengthen protein 
half-life, or facilitate manipulation of the protein in assays or production. It is contemplated 
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that cellular enzymes can be used to remove any additional amino acids from the mature 
protein. 

A precursor protein, having the mature form of the polypeptide fused to one or more 
prosequences may be an inactive form of the polypeptide. The inactive precursors generally 
are activated when the prosequences are removed. Some or all of the prosequences may be 
removed prior to activation. Such precursor protein are generally called proproteins. 

Plant Constructs and Methods of Use 

Of particular interest is the use of the nucleotide sequences in recombinant DNA 
constructs to direct the transcription or transcription and translation (expression) of the 
prenyltransferase sequences of the present invention in a host plant cell. The expression 
constructs generally comprise a promoter functional in a host plant cell operably linked to a 
nucleic acid sequence encoding a prenyltransferase of the present invention and a 
transcriptional termination region functional in a host plant cell. 

A first nucleic acid sequence is "operably linked" or "operably associated" with a 
second nucleic acid sequence when the sequences are so arranged that the first nucleic acid 
sequence affects the function of the second nucleic-acid sequence. Preferably, the two 
sequences are part of a single contiguous nucleic acid molecule and more preferably are 
adjacent. For example, a promoter is operably linked to a gene if the promoter regulates or 
mediates transcription of the gene in a cell. 

Those skilled in the art will recognize that there are a number of promoters which are 
functional in plant cells, and have been described in the literature. Chloroplast and plastid 
specific promoters, chloroplast or plastid functional promoters, and chloroplast or plastid 
operable promoters are also envisioned. 

One set of plant functional promoters are constitutive promoters such as the 
CaMV35S or FMV35S promoters that yield high levels of expression in most plant organs. 
Enhanced or duplicated versions of the CaMV35S and FMV35S promoters are useful in the 
practice of this invention (Odell, et al (1985) Nature 313:810-812; Rogers, U.S. Patent 
Number 5,378, 619). In addition, it may also be preferred to bring about expression of the 
prenyltransferase gene in specific tissues of the plant, such as leaf, stem, root, tuber, seed, ■ 
fruit, etc., and the promoter chosen should have the desired tissue and developmental 
specificity. 
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Of particular interest is the expression of the nucleic acid sequences of the present 
invention from transcription initiation regions which are preferentially expressed in a plant 
seed tissue. Examples of such seed preferential transcription initiation sequences include 
those sequences derived from sequences encoding plant storage protein genes or from genes 
5 involved in fatty acid biosynthesis in oilseeds. Examples of such promoters include the 5' 
regulatory regions from such genes as napin (Kridl et al, Seed Sci. Res. 7:209:219 (1991)), 
phaseolin, zein, soybean trypsin inhibitor, ACP, stearoyl-ACP desaturase, soybean a' subunit 
of P-conglycinin (soy 7s, (Chen et al, Proc. Natl Acad. ScL, 83:8560-8564 (1986))) and 
oleosin. 

10 It may be advantageous to direct the localization of proteins conferring 

prenyltransferase to a particular subcellular compartment, for example, to the mitochondrion, 
endoplasmic reticulum, vacuoles, chloroplast or other plastidic compartment. For example, 
where the genes of interest of the present invention will be targeted to plastids, such as 
chloroplasts, for expression, the constructs will also employ the use of sequences to direct the 

15 gene to the plastid. Such sequences are referred to herein as chloroplast transit peptides 
(CTP) or plastid transit peptides (PTP). In this manner, where the gene of interest is not 
directly inserted into the plastid, the expression construct will additionally contain a gene 
encoding a transit peptide to direct the gene of interest to the plastid. The chloroplast transit 
peptides may be derived from the gene of interest, or may be derived from a heterologous 

2 0 sequence having a CTP. Such transit peptides are known in the art. See, for example, Von 
Heijne et al (1991) Plant Moi Biol. Rep. 9:104-126; Clark et al (1989) J. Biol Chem. 
264:17544-17550; della-Cioppa et al (1987) Plant Physiol 84:965-968; Romer et al. (1993) 
Biochem. Biophys. Res Commun. 796:1414-1421; and, Shah et al (1986) Science 233'AIS- 
481. 

2 5 Depending upon the intended use, the constructs may contain the nucleic acid 

sequence which encodes the entire prenyltransferase protein, or a portion thereof. For 
example, where antisense inhibition of a given prenyltransferase protein is desired, the entire 
prenyltransferase sequence is not required. Furthermore, where prenyltransferase sequences 
used in constructs are intended for use as probes, it may be advantageous to prepare 

3 0 constructs containing only a particular portion of a prenyltransferase encoding sequence, for 

example a sequence which is discovered to encode a highly conserved prenyltransferase 
region. 
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The skilled artisan will recognize that there are various methods for the inhibition of 
expression of endogenous sequences in a host cell. Such methods include, but are not limited 
to, antisense suppression (Smith, etal (1988) Nature 334:724-726) , co-suppression (Napoli, 
etal (1989) Plant Cell 2:279-289), ribozymes (PCT Publication WO 97/10328), and 
5 combinations of sense and antisense Waterhouse, et al (1998) Proc. Natl Acad. ScL USA 
95: 13959-13964. Methods for the suppression of endogenous sequences in a host cell 
typically employ the transcription or transcription and translation of at least a portion of the 
sequence to be suppressed. Such sequences may be homologous to coding as well as non- 
' coding regions of the endogenous sequence. 
1 0 Regulatory transcript termination regions may be provided in plant expression 

constructs of this invention as well. Transcript termination regions may be provided by the 
DNA sequence encoding the prenyltransferase or a convenient transcription termination 
region derived from a different gene source, for example, the transcript termination region 
which is naturally associated with the transcript initiation region. The skilled artisan will 
15 recognize that any convenient transcript termination region which is capable of terminating 
transcription in a plant cell may be employed in the constructs of the present invention. 

Alternatively, constructs may be prepared to direct the expression of the 
prenyltransferase sequences directly from the host plant cell plastid. Such constructs and 
methods are known in the art and are generally described, for example, inSvab, et al. (1990) 
20 Proa Natl Acad. Sci. USA 87:8526-8530 and Svab and Maliga (1993) Proc. Natl. Acad. ScL 
USA 90:913-917 and in U.S. Patent Number 5,693,507. 

The prenyltransferase constructs of the present invention can be used in 
transformation methods with additional constructs providing for the expression of other 
nucleic acid sequences encoding proteins involved in the production of tocopherols, or 
2 5 tocopherol precursors such as homogentisic acid and/or phytylpyrophosphate. Nucleic acid 
sequences encoding proteins involved in the production of homogentisic acid are known in 
the art, and include but not are limited to, 4-hydroxyphenylpyruvatedioxygenase (HPPD, EC 
1.13.1 1.27) described for example, by Garcia, et al ((1999) Plant Physiol 1 19(4): 1507- 
1516), mono or Afunctional tyrA (described for example by Xia, et al (1992) 7. Gen 
30 Microbiol 138:1309-1316, and Hudson, et al (1984) 7. Mol Biol 180:1023-1051), 

Oxygenase, 4-hydroxyphenylpyruvatedi- (9CI), 4-Hydroxyphenylpymvatedioxygenase; p- 
Hydroxyphenylpyruvatedioxygenase; p-Hydroxyphenylpyruvate hydroxylase; p- 
Hydroxyphenylpyruvate oxidase; p-Hydroxyphenylpyruvic acid hydroxylase; p- 
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Hydroxyphenylpyruvic hydroxylase; p-Hydroxyphenylpyru vie oxidase), 4- 
hydroxyphenylacetate, NAD(P)H:oxygen oxidoreductase (1-hydroxylating); 4- 
hydroxyphenylacetate 1-monooxygenase, and the like. In addition, constructs for the 
expression of nucleic acid sequences encoding proteins involved in the production of 
5 phyty pyrophosphate can also be employed with the prenyltransferase constructs of the 
present invention. Nucleic acid sequences encoding proteins involved in the production of 
phytylpyrophosphate are known in the art, and include, but are not limited to 
geranylgeranylpyrophosphate synthase (GGPPS), geranylgeranylpyrophosphate reductase 
(GGH), l-deoxyxylulose-5 -phosphate synthase, 1- deoxy-D-xylolose-5-phosphate 
1 0 reductoisomerase, 4-diphosphocytidyl-2-C-methylerythritol synthase, isopentyl 

pyrophosphate isomerase. 

The prenyltransferase sequences of the present invention find use in the preparation of 
transformation constructs having a second expression cassette for the expression of additional 
sequences involved in tocopherol biosynthesis. Additional tocopherol biosynthesis 
15 sequences of interest in the present invention include, but are not limited to gamma-tocpherol 
methyltransferase (Shintani, et al (1998) Science 282(5396):2098-2100), tocopherol cyclase, 
and tocopherol methyltransferase. 

A plant cell, tissue, organ, or plant into which the recombinant DNA constructs 
containing the expression constructs have been introduced is considered transformed, 
2 0 transfected, or transgenic. A transgenic or transformed cell or plant also includes progeny of 
the cell or plant and progeny produced from a breeding program employing such a transgenic 
plant as a parent in a cross and exhibiting an altered phenotype resulting from the presence of 
a prenyltransferase nucleic acid sequence. 

Plant expression or transcription constructs having a prenyltransferase as the DNA 

2 5 sequence of interest for increased or decreased expression thereof may be employed with a 

wide variety of plant life, particularly, plant life involved in the production of vegetable oils 
for edible and industrial uses. Particularly preferred plants for use in the methods of the 
present invention include, but are not limited to: Acacia, alfalfa, aneth, apple, apricot, 
artichoke, arugula, asparagus, avocado, banana, barley, beans, beet, blackberry, blueberry, 

3 0 broccoli, brussels sprouts, cabbage, canola, cantaloupe, carrot, cassava, cauliflower, celery, 

cherry, chicory, cilantro, citrus, Clementines, coffee, corn, cotton, cucumber, Douglas fir, 
eggplant, endive, escarole, eucalyptus, fennel, figs, garlic, gourd, grape, grapefruit, honey 
dew, jicama, kiwifruit, lettuce, leeks, lemon, lime, Loblolly pine, mango, melon, mushroom, 
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nectarine, nut, oat, oil palm, oil seed rape, okra, onion, orange, an ornamental plant, papaya, 
parsley, pea, peach, peanut, pear, pepper, persimmon, pine, pineapple, plantain, plum, 
pomegranate, poplar, potato, pumpkin, quince, radiata pine, radicchio, radish, raspberry, rice, 
rye, sorghum, Southern pine, soybean, spinach, squash, strawberry, sugarbeet, sugarcane, 
5 sunflower, sweet potato, sweetgum, tangerine, tea, tobacco, tomato, triticale, turf, turnip, a 
vine, watermelon, wheat, yams, and zucchini. 

Most especially preferred are temperate oilseed crops. Temperate oilseed crops of interest 
include, but are not limited to, rapeseed (Canola and HighErucic Acid varieties), sunflower, 
safflower, cotton, soybean, peanut, coconut and oil palms, and corn. Depending on the 

10 method for introducing the recombinant constructs into the host cell, other DNA sequences 
may be required. Importantly, this invention is applicable to dicotyledyons and 
monocotyledons species alike and will be readily applicable to new and/or improved 
transformation and regulation techniques. 

Of particular interest, is the use of prenyltransferase constructs in plants to produce 

15 plants or plant parts, including, but not limited to leaves, stems, roots, reproductive, and seed, 
with a modified content of tocopherols in plant parts having transformed plant cells. 

For immunological screening, antibodies to the protein can be prepared by injecting 
rabbits or mice with the purified protein or portion thereof, such methods of preparing 
antibodies being well known to those in the art. Either monoclonal or polyclonal antibodies 

20 can be produced, although typically polyclonal antibodies are more useful for gene isolation. 
Western analysis may be conducted to determine that a related protein is present in a crude 
extract of the desired plant species, as determined by cross-reaction with the antibodies to the 
encoded proteins. When cross-reactivity is observed, genes encoding the related proteins are 
isolated by screening expression libraries representing the desired plant species. Expression 

2 5 libraries can be constructed in a variety of commercially available vectors, including lambda 

gtl 1, as described in Sambrook, et al. (Molecular Cloning: A Laboratory Manual, Second 
Edition (1989) Cold Spring Harbor Laboratory, Cold Spring Harbor, New York). 

To confirm the activity and specificity of the proteins encoded by the identified 
nucleic acid sequences as prenyltransferase enzymes, in vitro assays are performed in insect 

3 0 cell cultures using baculovirus expression systems. Such baculovirus expression systems are 

known in the art and are described by Lee, et al U.S. Patent Number 5,348,886, the entirety 
of which is herein incorporated by reference. 
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In addition, other expression constructs may be prepared to assay for protein activity 
utilizing different expression systems. Such expression constructs are transformed into yeast 
or prokaryotic host and assayed for prenyltransferase activity. Such expression systems are 
known in the art and are readily available through commercial sources. 

In addition to the sequences described in the present invention, DNA coding sequences 
useful in the present invention can be derived from algae, fungi, bacteria, mammalian 
sources, plants, etc. Homology searches in existing databases using signature sequences 
corresponding to conserved nucleotide and amino acid sequences of prenyltransferase can be 
employed to isolate equivalent, related genes from other sources such as plants and 
microorganisms. Searches in EST databases can also be employed. Furthermore, the use of 
DNA sequences encoding enzymes functionally enzymatically equivalent to those disclosed 
herein, wherein such DNA sequences are degenerate equivalents of the nucleic acid 
sequences disclosed herein in accordance with the degeneracy of the genetic code, is also 
encompassed by the present invention. Demonstration of the functionality of coding 
sequences identified by any of these methods can be carried out by complementation of 
mutants of appropriate organisms, such as Synechocystis, Shewanella, yeast, Pseudomonas, 
Rhodobacteria, etc., that lack specific biochemical reactions, or that have been mutated. The 
sequences of the DNA coding regions can be optimized by generesynthesis, based on codon 
usage, for maximum expression in particular hosts. 

For the alteration of tocopherol production in a host cell, a second expression 
construct can be used in accordance with the present invention. For example, the 
prenyltransferase expression construct can be introduced into a host cell in conjunction with 
a second expression construct having a nucleotide sequence for a protein involved in 
tocopherol biosynthesis. 

The method of transformation in obtaining such transgenic plants is not critical to the 
instant invention, and various methods of plant transformation are currently available. 
Furthermore, as newer methods become available to transform crops, they may also be 
direcdy applied hereunder. For example, many plant species naturally susceptible to 
Agrobacterium infection may be successfully transformed via tripartite or binary vector 
methods of Agrobacterium mediated transformation. In many instances, it will be desirable 
to have the construct bordered on one or both sides by T-DNA, particularly having the left 
and right borders, more particularly the right border. This is particularly useful when the 
construct uses A. tumefaciens or A. rhizogenes as a mode for transformation, although the T- 
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DNA borders may find use with other modes of transformation. In addition, techniques of 
microinjection, DNA particle bombardment, and electroporation have been developed which 
allow for the transformation of various monocot and dicot plant species. 

Normally, included with the DNA construct will be a structural gene having the 
necessary regulatory regions for expression in a host and providing for selection of 
transformant cells. The gene may provide for resistance to a cytotoxic agent, e.g. antibiotic, 
heavy metal, toxin, etc., complementation providing prototrophy to an auxotrophic host, viral 
immunity or the like. Depending upon the number of different host species the expression 
construct or components thereof are introduced, one or more markers may be employed, 
where different conditions for selection are used for the different hosts. 

Where Agrobacterium is used for plant cell transformation, a vector may be used 
which may be introduced into the Agrobacterium host for homologous recombination with T- 
DNA or the Ti- or Ri-plasmid present in the Agrobacterium host. The Ti- or Ri-plasmid 
containing the T-DNA for recombination may be armed (capable of causing gall formation) 
or disarmed (incapable of causing gall formation), the latter being permissible, so long as the 
vir genes are present in the transformed Agrobacterium host. The armed plasmid can give a 
mixture of normal plant cells and gall. 

In some instances where Agrobacterium is used as the vehicle for transforming host 
plant cells, the expression or transcription construct bordered by the T-DNA border region(s) 
will be inserted into a broad host range vector capable of replication in E. coli and 
Agrobacterium, there being broad host range vectors described in the literature. Commonly 
used is pRK2 or derivatives thereof. See, for example, Ditta, et al, (Proc. Nat. Acad. ScL, 
U.S.A. (1980) 77:7347-7351) and EPA 0 120 515, which are incorporated herein by reference. 
Alternatively, one may insert the sequences to be expressed in plant cells into a vector 
containing separate replication sequences, one of which stabilizes the vector in E. coli, and 
the other in Agrobacterium. See, for example, McBride, et al. (Plant Mol. Biol. (1990) 
74:269-276), wherein the pRiHRI (Jouanin, et al, Mol Gen. Genet. (1985) 201:370-374) 
origin of replication is utilized and provides for added stability of the plant expression vectors 
in host Agrobacterium cells. 

Included with the expression construct and the T-DNA will be one or more markers, 
which allow for selection of transformed Agrobacterium and transformed plant cells. A 
number of markers have been developed for use with plant cells, such as resistance to 
chloramphenicol, kanamycin, the aminoglycoside G418, hygromycin, or the like. The 
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particular marker employed is not essential to this invention, one or another marker being 
preferred depending on the particular host and the manner of construction. 

For transformation of plant cells using Agrobacterium, explants may be combined and 
incubated with the transformed Agrobacteriwn for sufficient time for transformation, the 
5 bacteria killed, and the plant cells cultured in an appropriate selective medium. Once callus 
forms, shoot formation can be encouraged by employing the appropriate plant hormones in 
accordance with known methods and the shoots transferred to rooting medium for 
regeneration of plants. The plants may then be grown to seed and the seed used to establish 
repetitive generations and for isolation of vegetable oils. 

1 0 There are several possible ways to obtain the plant cells of this invention which 

contain multiple expression constructs. Any means for producing a plant comprising a 
construct having a DNA sequence encoding the expression construct of the present invention, 
and at least one other construct having another DNA sequence encoding an enzyme are 
encompassed by the present invention. For example, the expression construct can be used to 

1 5 transform a plant at the same time as the second construct either by inclusion of both 

expression constructs in a single transformation vector or by using separate vectors, each of 
which express desired genes. The second construct can be introduced into a plant which has 
already been transformed with the prenyltransferase expression construct, or alternatively, 
transformed plants, one expressing the prenyltransferase construct and one expressing the 

2 0 second construct, can be crossed to bring the constructs together in the same plant. 

The nucleic acid sequences of the present invention can be used in constructs to 
provide for the expression of the sequence in a variety of host cells, both prokaryotic 
eukaryotic. Host cells of the present invention preferably include monocotyledenous and 
dicotyledenous plant cells. 
25 In general, the skilled artisan is familiar with the standard resource materials which 

describe specific conditions and procedures for the construction, manipulation and isolation 
of macromolecules (e.g., DNA molecules, plasmids, etc.), generation of recombinant 
organisms and the screening and isolating of clones, (see for example, Sambrook et aL, 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Press (1989); Maliga et al, 

3 0 Methods in Plant Molecular Biology, Cold Spring Harbor Press (1995), the entirety of which 

is herein incorporated by reference; Birren et aL, Genome Analysis: Analyzing DNA, 1, Cold 
Spring Harbor, New York, the entirety of which is herein incorporated by reference). 
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Methods for the expression of sequences in insect host cells are known in the art. 
Baculovirus expression vectors are recombinant insect viruses in which the coding sequence 
for a chosen foreign gene has been inserted behind a baculovirus promoter in place of the 
viral gene, e.g., polyhedrin (Smith and Summers, U.S. Pat. No., 4,745,051, the entirety of 
5 which is incorporated herein by reference). Baculovirus expression vectors are known in the 
art, and are described for example in Doerfler, Curr. Top, Microbiol Immunol 757:51-68 
(1968); Luckow and Summers, Bio/Technology 6:47-55 (1988a); Miller, Annual Review of 
Microbiol 42:177-199 (1988); Summers, Cum Comm. Molecular Biology, Cold Spring 
Harbor Press, Cold Spring Harbor, N.Y. (1988); Summers and Smith, A Manual of Methods 

10 for Baculovirus Vectors and Insect Cell Culture Procedures, Texas Ag. Exper. Station 
Bulletin No. 1555 (1988), the entireties of which is herein incorporated by reference) 

Methods for the expression of a nucleic acid sequence of interest in a fungal host cell 
are known in the art. The fungal host cell may, for example, be a yeast cell or a filamentous 
fungal cell. Methods for the expression of DNA sequences of interest in yeast cells are 

15 generally described in "Guide to yeast genetics and molecular biology", Guthrie and Fink, 
eds. Methods in enzymology , Academic Press, Inc. Vol 194 (1991) and Gene expression 
technology", Goeddel ed, Methods in Enzymology, Academic Press, Inc., Vol 185 (1991). 

Mammalian cell lines available as hosts for expression are known in the art and 
include many immortalized cell lines available from the American Type Culture Collection 

20 (ATCC, Manassas, VA), such as HeLa cells, Chinese hamster ovary (CHO) cells, baby 
hamster kidney (BHK) cells and a number of other cell lines. Suitable promoters for 
mammalian cells are also known in the art and include, but are not limited to, viral promoters 
such as that from Simian Virus 40 (SV40) (Fiers et al, Nature 273: 1 13 (1978), the entirety of 
which is herein incorporated by reference), Rous sarcoma virus (RSV), adenovirus (ADV) 

2 5 and bovine papilloma virus (BPV). Mammalian cells may also require terminator sequences 

and poly-A addition sequences. Enhancer sequences which increase expression may also be 
included and sequences which promote amplification of the gene may also be desirable (for 
example methotrexate resistance genes). 

Vectors suitable for replication in mammalian cells are well known in the art, and may 

3 0 include viral replicons, or sequences which insure integration of the appropriate sequences 

encoding epitopes into the host genome. Plasmid vectors that greatly facilitate the 
construction of recombinant viruses have been described (see, for example, Mackett et al J 
Virol 49:857 (1984); Chakrabarti et al y Mol Cell Biol 5:3403 (1985); Moss, In: Gene 
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Transfer Vectors For Mammalian Cells (Miller and Calos, eds., Cold Spring Harbor 
Laboratory, N.Y., p. 10, (1987); all of which are herein incorporated by reference in their 
entirety). 

The invention now being generally described, it will be more readily understood by 
5 reference to the following examples which are included for purposes of illustration only and 
are not intended to limit the present invention. 

EXAMPLES 

10 

Example 1: Identification of Prenyl transferase Sequences 

PSI-BLAST (Altschul, et ai (1997) Nuc Acid Res 25:3389-3402) profiles were 
generated for both the straight chain and aromatic classes of prenyltransferases. To generate 
15 the straight chain profile, a prenyl- transferase from Porphyra purpurea (Genbank accession 
1709766) was used as a query against the NCBI non-redundant protein database. The£. coli 
enzyme involved in the formation of ubiquinone, ubiA (genbank accession 1790473) was 
used as a starting sequence to generate the aromatic prenyltransferase profile. These profiles 
were used to search public and proprietary DNA and protein data bases. JnArabidopsis seven 

2 0 putative prenyltransferases of the straight-chain class were identified, ATPT1, (SEQ ID 

NO:9), ATPT7 (SEQ ID NO: 10), ATPT8 (SEQ ID NO:l 1), ATPT9 (SEQ ID NO: 13), 
ATPT10 (SEQ ID NO:14), ATPT1 1 (SEQ ID NO:15), and ATPT12 (SEQ ID NO: 16) and 
five were identified of the aromatic class, ATPT2 (SEQ ID NO: 1), ATPT3 (SEQ ID NO:3), 
ATPT4 (SEQ ID NO:5), ATPT5 (SEQ ID NO:7), ATPT6 (SEQ ID NO:8). Additional 
25 prenyltransferase sequences from other plants related to the aromatic class of 

prenyltransferases, such as soy (SEQ IDNOs: 19-23, the deduced amino acid sequence of 
SEQ ID NO:23 is provided in SEQ ID NO:24) and maize (SEQ ID NOs:25-29, and 31) are 
also identified. The deduced amino acid sequence of ZMPT5 (SEQ ID NO:29) is provided in 
SEQ ID NO:30. 

3 0 Searches are performed on a Silicon Graphics Unix computer using additional 

Bioaccellerator hardware and GenWeb software supplied by Compugen Ltd. This software 
and hardware enables the use of the Smith-Waterman algorithm in searching DNA and 
protein databases using profiles as queries. The program used to query protein databases is 
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profilesearch. This is a search where the query is not a single sequence but a profile based on 
a multiple alignment of amino acid or nucleic acid sequences. The profile is used to query a 
sequence data set, i.e., a sequence database. The profile contains all the pertinent information 
for scoring each position in a sequence, in effect replacing the "scoring matrix" used for the 
5 standard query searches. The program used to query nucleotide databases with a protein 
profile is tprofilesearch. Tprofilesearch searches nucleic acid databases using an amino acid 
profile query. As the search is running, sequences in the database are translated to amino acid 
sequences in six reading frames. The output file for tprofilesearch is identical to the output 
file for profilesearch except for an additional column that indicates the frame in which the 

10 best alignment occurred. 

The Smith-Waterman algorithm, (Smith and Waterman (1981) supra), is used to 
search for similarities between one sequence from the query and a group of sequences 
contained in the database. E score values as well as other sequence information, such as 
conserved peptide sequences are used to identify related sequences. 

15 To obtain the entire coding region corresponding to the Arabidopsis prenyltransferase 

sequences, synthetic oligonucleotide primers are designed to amplify the 5* and 3' ends of 
partial cDNA clones containing prenyltransferase sequences. Primers are designed according 
to the respective Arabidopsis prenyltransferase sequences and used in Rapid Amplification of 
cDNA Ends (RACE) reactions (Frohman et al (1988) Proc. Natl Acad. Sci. USA 85:8998- 

2 0 9002) using the Marathon cDNA amplification kit (Clontech Laboratories Inc, Palo Alto, 
CA). 

Additional BLAST searches are performed using the ATPT2 sequence, a sequence in 
the class of aromatic prenyl transferases. Additional sequences are identified in soybean 
libraries that are similar to the ATPT2 sequence. The additional soybean sequence 
25 demonstrates 80% identity and 91% similarity at the amino acid sequence. 

Amino acid sequence alignments between ATPT2 (SEQ ID NO:2), ATPT3 (SEQ ID 
NO:4), ATPT4 (SEQ ID NO:6), ATPT8 (SEQ ID NO:12), and ATPT12 (SEQ ID NO: 17) are 
performed using ClustalW (Figure 1), and the percent identity and similarities are provided in 
Table 1 below. 

30 

Table 1: 

ATPT2 ATPT3 ATPT4 ATPT8 ATPT12 
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Example 2: Preparation of Expression Constructs 

5 A plasmid containing the napin cassette derived from pCGN3223 (described in USPN 

5,639,790, the entirety of which is incorporated herein by reference) was modified to make it 
more useful for cloning large DNA fragments containing multiple restriction sites, and to 
allow the cloning of multiple napin fusion genes into plant binary transformation vectors. An 
adapter comprised of the self annealed oligonucleotide of sequence 
10 CGCGATTTAAATGGCGCGCCCTGCAGGCGGCCGCCTGCAGGGCGCGCCATTTAA 

AT (SEQ ID NO:40) was ligated into the cloning vector pBC SK+ (Stratagene) after 
digestion with the restriction endonuclease BssHII to construct vector pCGN7765. Plamids 
pCGN3223 and pCGN7765 were digested with NotI and ligated together. The resultant 
vector, pCGN7770, contains the pCGN7765 backbone with the napin seed specific 
1 5 expression cassette from pCGN3223. 

The cloning cassette, pCGN7787, essentially the same regulatory elements as 
pCGN7770, with the exception of the napin regulatory regions of pCGN7770 have been 
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replaced with the double CAMV 35S promoter and the tml polyadenylation and 
transcriptional termination region. 

A binary vector for plant transformation, pCGN5139, was constructed from 
pCGN1558 (McBride and Summerfelt, (1990) Plant Molecular Biology, 14:269-276). The 
5 polylinker of pCGN 1 558 was replaced as a HindIH/Asp7 1 8 fragment with apolylinker 
containing unique restriction endonuclease sites, AscI, Pad, Xbal, Swal, BamHI, and Notl. 
The Asp718 and HindlH restriction endonuclease sites are retained in pCGN5139. 

A series of turbo binary vectors are constructed to allow for the rapid cloning of DN A 
sequences into binary vectors containing transcriptional initiation regions (promoters) and 
1 0 transcriptional termination regions. 

The plasmid pCGN8618 was constructed by ligating oligonucleotides 5'- 
TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGG-3' (SEQ ID NO:41) and 5'- 
TCGACCTGCAGGAAGCTTGCGGCCGCGGATCC-S' (SEQ ID NO:42) into Sall/Xhol- 
digested pCGN7770. A fragment containing the napin promoter, polylinker and napin 3' 
1 5 region was excised from pCGN86 1 8 by digestion with Asp7 1 81; the fragment was blunt- 
ended by filling in the 5' overhangs with Klenow fragment then ligated into pCGN5139 that 
had been digested with Asp718I and HindlH and blunt-ended by filling in the 5' overhangs 
with Klenow fragment. A plasmid containing the insert oriented so that the napin promoter 
was closest to the blunted Asp718I site of pCGN5 139 and the napin 3* was closest to the 
2 0 blunted HindlH site was subjected to sequence analysis to confirm both the insert orientation 
and the integrity of cloning junctions. The resulting plasmid was designated pCGN8622. 

The plasmid pCGN8619 was constructed by ligating oligonucleotides 5'- 
TCGACCTGCAGGAAGCTTGCGGCCGCGGATCC -3' (SEQ ID NO:43) and 5'- 
TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGG-3' (SEQ ID NO:44) into Sall/Xhol- 
2 5 digested pCGN7770. A fragment containing the napin promoter, polylinker and napin 3' 
region was removed from pCGN8619 by digestion with Asp718I; the fragment was blunt- 
ended by filling in the 5* overhangs with Klenow fragment then ligated into pCGN5139 that 
had been digested with Asp718I and HindlH and blunt-ended by filling in the 5' overhangs 
with Klenow fragment. A plasmid containing the insert oriented so that the napin promoter 
30 was closest to the blunted Asp7 1 81 site of pCGN5 1 39 and the napin 3' was closest to the 

blunted HindlH site was subjected to sequence analysis to confirm both the insert orientation 
and the integrity of cloning junctions. The resulting plasmid was designated pCGN8623. 



24 



WO 00/63391 



PCT/US00/10368 



The plasmid pCGN8620 was constructed by ligating oligonucleotides 5'- 
TCGAGGATCCGCGGCCGCAAGCTTCCTGCAGGAGCT -3' (SEQ ID NO:45) and 5'- 
CCTGCAGGAAGCTTGCGGCCGCGGATCC-3 t (SEQ ID NO:46) into Sall/SacI-digested 
pCGN7787. A fragment containing the d35S promoter, polylinker and tml 3' region was 
5 removed from pCGN8620 by complete digestion with Asp7 1 81 and partial digestion with 
Notl. The fragment was blunt-ended by filling in the 5' overhangs withKlenow fragment 
then ligated into pCGN5139 that had been digested with Asp718I and Hindlll and blunt- 
ended by filling in the 5* overhangs with Klenow fragment. A plasmid containing the insert 
oriented so that the d35S promoter was closest to the blunted Asp718I site of pCGN5139 and 

10 the tml y was closest to the blunted Hindlll site was subjected to sequence analysis to 
confirm both the insert orientation and the integrity of cloning junctions. The resulting 
plasmid was designated pCGN8624. 

The plasmid pCGN862 1 was constructed by ligating oligonucleotides 5'- 
TCGACCTGCAGGAAGCTTGCGGCCGCGGATCCAGCT -3' (SEQ ID NO:47) and 5'- 

15 GG ATCCGCGGCCGC A AGCTTCCTGC AGG-3 ' (SEQ ID NO:48) into Sall/SacI-digested 
pCGN7787. A fragment containing the d35S promoter, polylinker and tml 3' region was 
removed from pCGN8621 by complete digestion with Asp718I and partial digestion with 
Notl. The fragment was blunt-ended by filling in the 5' overhangs withKlenow fragment 
then ligated into pCGN5139 that had been digested with Asp718I and Hindlll and blunt- 

2 0 ended by filling in the 5' overhangs with Klenow fragment. A plasmid containing the insert 
oriented so that the d35S promoter was closest to the blunted Asp718I site of pCGN5139 and 
the tml 3' was closest to the blunted Hindlll site was subjected to sequence analysis to 
confirm both the insert orientation and the integrity of cloning junctions. The resulting 
plasmid was designated pCGN8625. 

2 5 The plasmid construct pCGN8640 is a modification of pCGN8624 described above. 

A 938bp PstI fragment isolated from transposon Tn7 which encodes bacterial spectinomycin 
and streptomycin resistance (Fling et al. (1985), Nucleic Acids Research 13(19):7095-7106), a 
determinant for E. coli and Agrobacterium selection, was blunt ended with Pfu polymerase. 
The blunt ended fragment was ligated into pCGN8624 that had been digested with Spel and 

3 0 blunt ended with Pfu polymerase. The region containing the PstI fragment was sequenced to 

confirm both the insert orientation and the integrity of cloning junctions. 

The spectinomycin resistance marker was introduced into pCGN8622 and pCGN8623 
as follows. A 7.7 Kbp Avrll-SnaBI fragment from pCGN8640 was ligated to a 10.9 Kbp 
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Avrll-SnaBI fragment from pCGN8623 or pCGN8622, described above. The resulting 
plasmids were pCGN8641 and pCGN8643, respectively. 

The plasmid pCGN8644 was constructed by ligating oligonucleotides 5'- 
GATCACCTGCAGGAAGCTTGCGGCCGCGGATCCAATGCA-3' (SEQ ID NO:49) and 
5'- TTGGATCCGCGGCCGCAAGCTTCCTGCAGGT-3' (SEQ ID NO:50) into BamHI-PstI 
digested pCGN8640. 

Synthetic oligonulceotides were designed for use in Polymerase Chain Reactions (PCR) 
to amplify the coding sequences of ATPT2, ATPT3, ATPT4, ATPT8, and ATPT12 for the 
preparation of expression constructs and are provided in Table 2 below. 

Table 2: 

Name Restriction Site Sequence SEQ ID NO: 

ATPT2 5'NotI GGATCCGCGGCCGCACAATGGAGTC 51 

TCTGCTCTCTAGTTCT 



ATPT2 3'SseI GGATCCTGCAGGTCACTTCAAAAAA 52 
GGTAACAGCAAGT 



ATPT3 


5' NotI 


GGATCCGCGGCCGCACAATGGCGTT 
TTTTGGGCTCTCCCGTGTTT 


53 


ATPT3 


3' Ssel 


GGATCCTGCAGGTTATTGAAAACTT 
CTTCCAAGTACAACT 


54 


ATPT4 


5' NotI 


GGATCCGCGGCCGCACAATGTGGCG 
AAGATCTGTTGTT 


55 


ATPT4 


3' Ssel 


GGATCCTGCAGGTCATGGAGAGTAG 
AAGGAAGGAGCT 


56 


ATPT8 


5' NotI 


GGATCCGCGGCCGCACAATGGTACT 
TGCCGAGGTTCCAAAGCTTGCCTCT 


57 


ATPT8 


3' Ssel 


GGATCCTGCAGGTCACTTGTTTCTG 
GTGATGACTCTAT 


58 


ATPT12 


5' NotI 


GGATCCGCGGCCGCACAATGACTTC 
GATTCTCAACACT 


59 


ATPT12 


3' Ssel 


GGATCCTGCAGGTCAGTGTTGCGAT 
GCTAATGCCGT 


60 



The coding sequences of ATPT2, ATPT3, ATPT4, ATPT8, and ATPT12 were all 
amplified using the respective PCR primers shown in Table 2 above and cloned into the TopoTA 
15 vector (Invitrogen). Constructs containing the respective prenyltransferase sequences were 
digested with NotI and Sse8387I and cloned into the turbobinary vectors described above. 

The sequence encoding ATPT2 prenyltransferase was cloned in the sense orientation into 
pCGN8640 to produce the plant transformation construct pCGN 10800 (Figure 2). The ATPT2 
sequence is under control of the 35S promoter. 
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The ATPT2 sequence was also cloned in the antisense orientation into the construct 
pCGN8641 to create pCGN 10801 (Figure 3). This construct provides for the antisense 
expression of the ATPT2 sequence from the napin promoter. 

The ATPT2 coding sequence was also cloned in the antisense orientation into the vector 
5 pCGN8643 to create the plant transformation construct pCGN 10802 

The ATPT2 coding sequence was also cloned in the antisense orientation into the vector 
pCGN8644 to create the plant transformation construct pCGN 10803 (Figure 4). 

The ATPT4 coding sequence was cloned into the vector pCGN864 to create the plant 
transformation construct pCGN 10806 (Figure 5). The ATPT2 coding sequence was cloned into 
1 0 the vector pCGN864 to create the plant transformation construct pCGN 1 0807(Figure 6). The 

ATPT3 coding sequence was cloned into the vector pCGN864 to create the plant transformation 
construct pCGN 10808 (Figure 7). The ATPT3 coding sequence was cloned in the sense 
orientation into the vector pCGN8640 to create the plant transformation construct pCGN 10809 
(Figure 8). The ATPT3 coding sequence was cloned in the antisense orientation into the vector 
1 5 pCGN864 1 to create the plant transformation construct pCGN 10810 (Figure 9) . The ATPT3 
coding sequence was cloned into the vector pCGN8643 to create the plant transformation 
construct pCGN10811 (Figure 10). The ATPT3 coding sequence was cloned into the vector 
pCGN8640 to create the plant transformation construct pCGN10812 (Figure 1 1). The ATPT4 
coding sequence was cloned into the vector pCGN8640 to create the plant transformation 
20 construct pCGN10813 (Figure 12). The ATPT4 coding sequence was cloned into the vector 
pCGN8643 to create the plant transformation construct pCGN10814 (Figure 13). The ATPT4 
coding sequence was cloned into the vector pCGN8641 to create the plant transformation 
construct pCGN10815 (Figure 14). The ATPT4 coding sequence was cloned in the antisense 
orientation into the vector pCGN8644 to create the plant transformation construct pCGN10816 

2 5 (Figure 15). The ATPT2 coding sequence was cloned into the vector pCGN???? to create the 

plant transformation construct pCGN10817 (Figure 16). The ATPT8 coding sequence was cloned 
in the sense orientation into the vector pCGN8643 to create the plant transformation construct 
pCGN10819 (Figure 17). The ATPT12 coding sequence was cloned into the vector pCGN8644 
to create the plant transformation construct pCGN 10824 (Figure 18). The ATPT12 coding 

3 0 sequence was cloned into the vector pCGN8641 to create the plant transformation construct 

pCGN10825 (Figure 19). The ATPT8 coding sequence was cloned into the vector pCGN8644 to 
create the plant transformation construct pCGN 10826 (Figure 20). 
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Example 3: Plant Transformation 

Transgenic Brassica plants are obtained by Agrobacterium-mediattd transformation 
as described by Radke et al (Theor. Appl. Genet. (1988) 75:685-694; Plant Cell Reports 
(1992) 77:499-505). Transgenic Arabidopsis thaliana plants may be obtained by 
Agrobacterium-mediated transformation as described by Valverkens et aL, (Proc. Nat, Acad. 
Sci. (1988) 55:5536-5540), or as described by Bent et al. ((1994), Sconce 265:1856-1860), or 
Bechtold et al. ((1993), CRAcad.Sci, Life Sciences 316: 1 194-1 199). Other plant species may 
be similarly transformed using related techniques. 

Alternatively, microprojectile bombardment methods, such as described by Klein et 
al. (Bio/Technology 70:286-291) may also be used to obtain nuclear transformed plants. 

Example 4: Identification of Additional Prenyltransferases 

A PSI-Blast profile generated using the£. coli ubiA (genbank accession 1790473) 
sequence was used to analyze the Synechocystis genome. This analysis identified 5 open 
reading frames (ORFs) in the Synechocystis genome that were potentially prenyltransferases; 
sh0926 (annotated as ubiA (4-hydroxybenzoate-octaprenyl transferase, SEQ ID NO:32), 
slll899 (annotated as ctaB (cytocrome c oxidase folding protein, SEQ ID NO:33), slr0056 
(annotated as g4 (chlorophyll synthase 33 kd subunit, SEQ ID NO:34), slrl518 (annotated as 
menA (menaquinone biosynthesis protein, SEQ ID NO:35), and sir 1736 (annotated as a 
hypothetical protein of unknown function (SEQ ID NO:36). 

To determine the functionality of these ORFs and their involvement, if any, in the 
biosynthesis of Tocopherols, knockouts constructs were made to disrupt the ORF identified in 
Synechocystis. 

Synthetic oligos were designed to amplify regions from the 5' (5'- 
TAATGTGTACATTGTCGGCCTC (17365') (SEQ ID NO:61) and 5'- 
GCAATGTAACATCAGAGATTTTGAGACACAACGTGGCTTTCCACAATTCCCCGCA 

CCGTC (1736kanprl)) (SEQ ID NO:62) and 3' (5 ' - AGGCTA AT AAGC AC AAATGGG A 
(17363') (SEQ ID NO:63) and 5'- 
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GGTATGAGTCAGCAACACCTTCTTCACGAGGCAGACCTCAGC 
GGAATTGGTTTAGGTTATCCC (1736kanpr2)) (SEQ ID NO: 64) ends of the sir 173 6 ORF. 
The 1736kanprl and 1736kanpr2 oligos contained 20 bp of homology to the slrl736 ORF 
with an additional 40 bp of sequence homology to the ends of the kanamycin resistance 
5 cassette. Separate PCR steps were completed with these oligos and the products were gel 
purified and combined with the kanamycin resistance gene from puc4K (Pharmacia) that had 
been digested with Hindi and gel purified away from the vector backbone. The combined 
fragments were allowed to assemble without oligos under the following conditions: 94°C for 
1 min, 55°C for 1 min, 72°C for 1 min plus 5 seconds per cycle for 40 cycles using pfu 

1 0 polymerase in 1 OOul reaction volume (Zhao, H and Arnold ( 1 997) Nucleic Acids Res. 

25(6): 1307- 1308). One microliter or five microliters of this assembly reaction was then 
amplified using 5* and 3' oligos nested within the ends of the ORF fragment, so that the 
resulting product contained 100-200 bp of the 5* end of the Synechocystis gene to be knocked 
out, the kanamycin resistance cassette, and 100-200 bp of the 3' end of the gene to be 

1 5 knocked out. This PCR product was then cloned into the vector pGemT easy (Promega) to 
create the construct pMON21681 and used for Synechocystis transformation. 

Primers were also synthesized for the preparation of Synechocystis knockout 
constructs for the other sequences using the same method as described above, with the 
following primers. The ubiA 5' sequence was amplified using the primers 5'- 

2 0 GGATCCATGGTT GCCCAAACCCCATC (SEQ ID NO:65) and 5'- 

GCAATGTAACATCAGAGA TTTTGAGAC ACA ACG 

TGGCTTTGGGTAAGCAACAATGACCGGC (SEQ ID NO:66). The 3' region was 
amplified using the synthetic oligonucleotide primers 5'- 

GAATTCTCAAAGCCAGCCCAGTAAC (SEQ ID NO:67) and 5 ' -GGTATG AGTC 
25 AGCAACACCTTCTTCACGAGGCAGACCTCAGCGGGTGCGAAAAGGGTTTTCCC 
(SEQ ID NO:68). The amplification products were combined with the kanamycin resistance 
gene from puc4K (Pharmacia) that had been digested with Hindi and gel purified away from 
the vector backbone. The annealed fragment was amplified using 5' and 3' oligos nested 
within the ends of the ORF fragment (5'- CCAGTGGTTTAGGCTGTGTGGTC (SEQ ID 

3 0 NO:69) and 5'- CTGAGTTGGATGTATTGGATC (SEQ ID NO:70)), so that the resulting 

product contained 100-200 bp of the 5' end of the Synechocystis gene to be knocked out, the 
kanamycin resistance cassette, and 100-200 bp of the 3* end of the gene to be knocked out. 



29 



WO 00/63391 



PCT/US00/10368 



This PCR product was then cloned into the vector pGemT easy (Promega) to create the 
construct pMON21682 and used for Synechocystis transformation. 

Primers were also synthesized for the preparation of Synechocystis knockout 
constructs for the other sequences using the same method as described above, with the 
following primers. The sll 1899 5' sequence was amplified using the primers 5'- 
GGATCCATGGTTACTT CGACAAAAATCC (SEQ ID NO:71) and 5'- 
GCAATGTAACATCAGAG 

ATTTTGAGACACAACGTGGCTTTGCTAGGCAACCGCTTAGTAC (SEQ ID NO:72). 
The 3' region was amplified using the synthetic oligonucleotide primers 5*- 
GAATTCTTAACCCAACAGTAAAGTTCCC (SEQ ID NO:73) and 5'- 
GGTATGAGTCAGC 

AACACCTTCTTCACGAGGCAGACCTCAGCGCCGGCATTGTCTTTTACATG (SEQ ID 
NO:74). The amplification products were combined with the kanamycin resistance gene from 
puc4K (Pharmacia) that had been digested with Hindi and gel purified away from the vector 
backbone. The annealed fragment was amplified using 5' and 3' oligos nested within the 
ends of the ORF fragment (5'- GGAACCCTTGCAGCCGCTTC (SEQ ID NO:75) 
and 5'- GTATGCCCAACTGGTGCAGAGG (SEQ ID NO:76)), so that the resulting product 
contained 100-200 bp of the 5' end of the Synechocystis gene to be knocked out, the 
kanamycin resistance cassette, and 100-200 bp of the 3' end of the gene to be knocked out. 
This PCR product was then cloned into the vector pGemT easy (Promega) to create the 
construct pMON21679 and used for Synechocystis transformation. 

Primers were also synthesized for the preparation of Synechocystis knockout 
constructs for the other sequences using the same method as described above, with the 
following primers. The slr0056 5' sequence was amplified using the primers 5'- 
GGATCCATGTCTGACACACAAAATACCG (SEQ ID NO:77) and 5'- 
GCAATGTAACATCAGAGATTTTGAGACACAACGTGGCTTTCGCCAATACCAGCCA 

CCAACAG (SEQ ID NO:78). The 3' region was amplified using the synthetic 
oligonucleotide primers 5'- GAATTCTCAAAT CCCCGCATGGCCTAG (SEQ ID NO:79) 
and 5'- 

GGTATGAGTCAGCAACACCTTCTTCACGAGGCAGACCTCAGCGGCCTACGGCTTG 
GACGTGTGGG (SEQ ID NO: 80). The amplification products were combined with the 
kanamycin resistance gene from puc4K (Pharmacia) that had been digested v/ith Hindi and 
gel purified away from the vector backbone. The annealed fragment was amplified using 5' 
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and 3' oligos nested within the ends of the ORF fragment (5*- 
CACTTGGATTCCCCTGATCTG (SEQ ID NO:81) and 5 ! - 
GCAATACCCGCTTGGAAAACG (SEQ ID NO: 82)), so that the resulting product 
contained 100-200 bp of the 5* end of the Synechocystis gene to be knocked out, the 
5 kanamycin resistance cassette, and 100-200 bp of the 3' end of the gene to be knocked out. 
This PCR product was then cloned into the vector pGemT easy (Promega) to create the 
construct pMON21677 and used for Synechocystis transformation. 

Primers were also synthesized for the preparation of Synechocystis knockout 
constructs for the other sequences using the same method as described above, with the 
10 following primers. The slr!518 5' sequence was amplified using the primers 5*- 
GGATCCATGACCGAAT CTTCGCCCCTAGC (SEQ ID NO:83) and 5'- 
GCAATGTAACATCAGAGATTTTGA GACACAACGTGGC 

TTTCAATCCTAGGTAGCCGAGGCG (SEQ ID NO:84). The 3' region was amplified 
using the synthetic oligonucleotide primers 5*- GAATTCTTAGCCCAGGCC AGCCCAGCC 

15 (SEQ ID NO:85)and 5*- GGTATGAGTCAGCAACACCTTCTTCACGA 

GGCAGACCTCAGCGGGGAATTGATTTGTTTAATTACC (SEQ ID NO:86). The 
amplification products were combined with the kanamycin resistance gene from puc4K 
(Pharmacia) that had been digested with HincU and gel purified away from the vector 
backbone. The annealed fragment was amplified using 5' and 3' oligos nested within the 

2 0 ends of the ORF fragment (5'- GCGATCGCCATTATCGCTTGG (SEQ ID NO:87) and 5*- 
GCAGACTGGCAATTATCAGTAACG (SEQ ID NO:88)), so that the resulting product 
contained 100-200 bp of the 5' end of the Synechocystis gene to be knocked out, the 
kanamycin resistance cassette, and 100-200 bp of the 3' end of the gene to be knocked out. 
This PCR product was then cloned into the vector pGemT easy (Promega) to create the 

2 5 construct pMON2 1 680 and used for Synechocystis transformation. 

B. Transformation of Synechocystis 

Cells of Synechocystis 6803 were grown to a density of approximately 2x10? cells per 
ml and harvested by centrifugation. The cell pellet was re-suspended in fresh BG-1 1 medium 

3 0 (ATCC Medium 616) at a density of lxlO 9 cells per ml and used immediately for 

transformation. One-hundred microliters of these cells were mixed with 5 ul of mini prep 
DNA and incubated with light at 30C for 4 hours. This mixture was then plated onto nylon 
filters resting on BG-1 1 agar supplemented with TES pH8 and allowed to grow for 12-18 
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hours. The filters were then transferred to BG-1 1 agar + TES + 5ug/ml kanamycin and 
allowed to grow until colonies appeared within 7-10 days (Packer and Glazer, 1988). 
Colonies were then picked into BG-1 1 liquid media containing 5 ug/ml kanamycin and 
allowed to grow for 5 days. These cells were then transferred to Bg-1 1 media containing 
5 lOug/ml kanamycin and allowed to grow for 5 days and then transferred to Bg-1 1 + 

kanamycin at 25ug/ml and allowed to grow for 5 days. Cells were then harvested for PCR 
analysis to determine the presence of a disrupted ORF and also for HPLC analysis to 
determine if the disruption had any effect on tocopherol levels. 

PCR analysis of the Synechocystis isolates for slrl736 and s!11899 showed complete 

10 segregation of the mutant genome, meaning no copies of the wild type genome could be 

detected in these strains. This suggests that function of the native gene is not essential for cell 
function. HPLC analysis of these same isolates showed that the sill 899 strain had no 
detectable reduction in tocopherol levels. However, the strain carrying the knockout for 
slrl736 produced no detectable levels of tocopherol. 

15 The amino acid sequences for the Synechocystis knockouts are compared using 

ClustalW, and are provided in Table 3 below. Provided are the percent identities, percent 
similarity, and the percent gap. The alignment of the sequences is provided in Figure 21. 



Table 3: 
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slr0926 
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slr0056 


slrl518 
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slrl518 %identity 




%similar 




%gap 





Amino acid sequence comparisons are performed using various Arabidopsis 
prenyltransferase sequences and the Synechocystis sequences. The comparisons are presented 
in Table 4 below. Provided are the percent identities, percent similarity, and the percent gap. 
5 The alignment of the sequences is provided in Figure 22. 



Table 4: 





ATPT2 
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9 


13 


slr0056 


23 


32 




10 


8 






7 


ATPT8 




23 






7 


sir 15 1 s 







4B. Preparation oftheslrl 737 Knockout 

The Synechocystis sp. 6803 sir 1737 knockout was constructed by the following 
5 method. The GPS™-1 Genome Priming System (New England Biolabs) was used to insert, 
by a Tn7 Transposase system, a Kanamycin resistance cassette mtoslrl737. A plasmid 
from a Synechocystis genomic library clone containing 652 base pairs of the targeted orf 
{Synechcocystis genome base pairs 1324051 - 1324703; the predicted orf base pairs 1323672 
- 1324763, as annotated by Cyanobase) was used as target DNA. The reaction was 

1 0 performed according to the manufacturers protocol. The reaction mixture was then 

transformed into E. coli DH10B electrocompetant cells and plated. Colonies from this 
transformation were then screened for transposon insertions into the target sequence by 
amplifying with Ml 3 Forward and Reverse Universal primers, yielding a product of 652 base 
pairs plus -1700 base pairs, the size of the transposon kanamycin cassette, for a total 

15 fragment size of -2300 base pairs. After this determination, it was then necessary to 

determine the approximate location of the insertion within the targeted orf, as 100 base pairs 
of orf sequence was estimated as necessary for efficient homologous recombination in 
Synechocystis. This was accomplished through amplification reactions using either of the 
primers to the ends of the transposon, Primer S (5* end) or N (3' end), in combination with 

2 0 either a M13 Forward or Reverse primer. That is, four different primer combinations were 
used to map each potential knockout construct: Primer S - M 1 3 Forward, Primer S - M 1 3 
Reverse, Primer N - M 1 3 Forward, Primer N - M 1 3 Reverse. The construct used to 
transform Synechocystis and knockout sir 1737 was determined to consist of a approximately 
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150 base pairs of slrl737 sequence on the 5 y side of the transposon insertion and 
approximately 500 base pairs on the 3' side, with the transcription of the orf and kanamycin 
cassette in the same direction. The nucleic acid sequence of sir 1737 is provided in SEQ ID 
NO:38 the deduced amino acid sequence is provided in SEQ ID NO:39. 
5 Cells of Synechocystis 6803 were grown to a density of - 2x 1 0 8 cells per ml and 

harvested by centrifugation. The cell pellet was re-suspended in fresh BG-1 1 medium at a 
density of lxlO 9 cells per ml and used immediately for transformation. 100 ul of these cells 
were mixed with 5 ul of mini prep DNA and incubated with light at 30C for 4 hours. This 
mixture was then plated onto nylon filters resting on BG-1 1 agar supplemented with TES ph8 

10 and allowed to grow for 12-18 hours. The filters were then transferred to BG-1 1 agar + TES 
+ 5ug/ml kanamycin and allowed to grow until colonies appeared within 7-10 days (Packer 
and Glazer, 1988). Colonies were then picked into BG-1 1 liquid media containing 5 ug/ml 
kanamycin and allowed to grow for 5 days. These cells were then transferred to Bg-1 1 media 
containing lOug/ml kanamycin and allowed to grow for 5 days and then transferred to Bg-1 1 

15 + kanamycin at 25ug/ml and allowed to grow for 5 days. Cells were then harvested for PGR 
analysis to determine the presence of a disrupted ORF and also for HPLC analysis to 
determine if the disruption had any effect on tocopherol levels. 

PCR analysis of the Synechocystis isolates, using primers to the ends of the sir 1737 
orf , showed complete segregation of the mutant genome, meaning no copies of the wild type 

2 0 genome could be detected in these strains. This suggests that function of the native gene is 
not essential for cell function. HPLC analysis of the strain carrying the knockout for sir 1737 
produced no detectable levels of tocopherol. 

4C. Phytyl Prenyltransferase Enzyme Assays 

2 5 [ 3 H] Homogentisic acid in 0. 1 % H3PO4 (specific radioactivity 40 Ci/mmol). Phytyl 

pyrophosphate was synthesized as described by Joo, et al (1973) Can J. Biochem. 51:1527. 
2-methyl-6-phytylquinol and 2,3-dimethyl-5-phytylquinol were synthesized as described by 
Soil, et al (1980) Phytochemistry 19:215. Homogentisic acid, a, (J, 8, and y-tocopherol, and 
tocol, were purchased commercially. 

3 0 The wild-type strain of Synechocystis sp. PCC 6803 was grown in BG1 1 medium with 

bubbling air at 30°C under 50 [iE.m'ls* 1 fluorescent light, and 70% relative humidity. The 
growth medium of sir 1736 knock-out (potential PPT) strain of this organism was 
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supplemented with 25 |ig mL' 1 kanamycin. Cells were collected from 0.25 to 1 liter culture by 
centrifugation at 5000 g for 10 min and stored at -80°C. 

Total membranes were isolated according to Zak's procedures with some modifications 
(Zak, et al (1999) Eur J. Biochem 261:31 1). Cells were broken on a French press. Before the 
5 French press treatment, the cells were incubated for 1 hour with lysozyme (0.5%, w/v) at 30 °C 
in a medium containing 7 mM EDTA, 5 mM NaCl and 10 mM Hepes-NaOH, pH 7.4. The 
spheroplasts were collected by centrifugation at 5000 £ for 10 min and resuspended at 0.1 - 0.5 
mg chlorophyll-mL" 1 in 20 mM potassium phosphate buffer, pH 7.8. Proper amount of 
protease inhibitor cocktail and DNAase I from Boehringer Mannheim were added to the 

10 solution. French press treatments were performed two to three times at 100 MPa. After 

breakage, the cell suspension was centrifugedfor 10 min at 5000g to pellet unbroken cells, and 
this was followed by centrifugation at 100 000 £ for 1 hour to collect total membranes. The 
final pellet was resuspended in a buffer containing 50 mM Tris-HCL and 4 mM MgCl 2 . 
Chloroplast pellets were isolated from 250 g of spinach leaves obtained from local 

15 markets. Devined leaf sections were cut into grinding buffer (2 1 /250 g leaves) containing 2 
mM EDTA, 1 mM MgCl 2 , 1 mM MnCl 2 , 0.33 M sorbitol, 0.1% ascorbic acid, and 50 mM 
Hepes at pH 7.5. The leaves were homogenized for 3 sec three times in a 1-Lblendor, and 
filtered through 4 layers of mirocloth. The supernatant was then centrifuged at 500C£ for 6 
min. The chloroplast pellets were resuspended in small amount of grinding buffer (Douce,e/ 

20 al Methods in Chloroplast Molecular Biology, 239 ( 1 982) 

Chloroplasts in pellets can be broken in three ways. Chloroplast pellets were first 
aliquoted in 1 mg of chlorophyll per tube, centrifuged at 6000 rpm for 2 min in 
microcentrifuge, and grinding buffer was removed. Two hundred microliters of Triton X-100 
buffer (0.1% Triton X-100, 50 mM Tris-HCl pH 7.6 and 4 mM MgCl 2 ) or swelling buffer (10 

25 mM Tris pH 7.6 and 4 mM MgCl 2 ) was added to each tube and incubated for Vi hour at 4°C. 
Then the broken chloroplast pellets were used for the assay immediately. In addition, broken 
chloroplasts can also be obtained by freezing in liquid nitrogen and stored at -8(fC for Vi hour, 
then used for the assay. 

In some cases chloroplast pellets were further purified with 40%/ 80% percoll gradient to 

3 0 obtain intact chloroplasts. The intact chloroplasts were broken with swelling buffer, then 
either used for assay or further purified for envelope membranes with 20.5%/ 31.8% sucrose 
density gradient (Sol, et al (1980) supra). The membrane fractions were centrifuged at 100 
OOOg for 40 min and resuspended in 50 mM Tris-HCl pH 7.6, 4 mM MgCl 2 . 
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Various amounts of [ 3 H]HGA, 40 to 60 |jM unlabelled HGA with specific activity in the 
range of 0. 16 to 4 Ci/mmole were mixed with a proper amount of 1M Tris-NaOH pH 10 to 
adjust pH to 7.6. HGA was reduced for 4 min with a trace amount of solid NaBHU. In 
addition to HGA, standard incubation mixture (final vol 1 mL) contained 50 mM Tris-HCl, pH 
7.6, 3-5 mM MgCl 2 , and 100 \M phytyl pyrophosphate. The reaction was initiated by 
addition of Synechocystis total membranes, spinach chloroplast pellets, spinach broken 
chloroplasts, or spinach envelope membranes. The enzyme reaction was carried out for 2 hour 
at 23°C or 30°C in the dark or light. The reaction is stopped by freezing with liquid nitrogen, 
and stored at -80°C or directly by extraction. 

A constant amount of tocol was added to each assay mixture and reaction products were 
extracted with a 2 mL mixture of chloroform/methanol (1:2, v/v) to give a monophasic 
solution. NaCl solution (2 mL; 0.9%) was added with vigorous shaking. This extraction 
procedure was repeated three times. The organic layer containing theprenylquinones was 
filtered through a 20 mji filter, evaporated under N 2 , and then resuspended in 100 |iL of 
ethanol. 

The samples were mainly analyzed by Normal-Phase HPLC method (Isocratic 90% 
Hexane and 10% Methyl-t-butyl ether), and use aZorbax silica column, 4.6 x 250 mm. The 
samples were also analyzed by Reversed-Phase HPLC method (Isocratic 0.1% H3PO4 in 
MeOH), and use a Vydac 201HS54 C18 column; 4.6 x 250 mm coupled with an All-tech C18 
guard column. The amount of products were calculated based on the substrate specific 
radioactivity, and adjusted according to the % recovery based on the amount of internal 
standard. 

The amount of chlorophyll was determined as described in Arnon (1949) Plant Physiol 
24: 1 . Amount of protein was determined by the Bradford method using gamma globulin as a 
standard (Bradford, (1976) Anal Biochem. 72:248) 

Results of the assay demonstrate that 2-Methyl-6-Phytylplastoquinone is produced in 
the Synechocystis slrl736 knockout preparations. The results of the phytyl prenyltransferase 
enzyme activity assay for the slrl736 knock out are presented in Figure 23. 

4D. Complementation of the slrl736 knockout with ATPT2 

In order to determine whether ATPT2 could complement the knockout of slrl736 in 
Synechocystis 6803 a plasmid was constructed to express the ATPT2 sequence from the TAC 
promoter. A vector, plasmid psll21 1, was obtained from the lab of Dr. Himadri Pakrasi of 
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Washington University, and is based on the plasmid RSF1010 which is a broad host range 
plasmid (Ng W.-O., Zentella R., Wang, Y., Taylor J-S. A., Pakrasi, H.B. 2000. phrA, the 
major photoreactivating factor in the cyanobacterium Synechocystis sp. strain PCC 6803 
codes for a cyclobutane pyrimidine dimer specific DNA photolyase. Arch. Microbiol, (in 
press)). The ATPT2 gene was isolated from the vector pCGN10817 by PCR using the 
following primers. ATPT2nco.pr 5 ' -CC ATGG ATTCG AGT A A AGTTGTCGC (SEQ ID 
NO:89); ATPT2ri.pr- 5 ' -GAATTC ACTTCAAAA AAGGT A AC AG (SEQ ID NO:90). These 
primers will remove approximately 1 12 BP from the 5' end of the ATPT2 sequence, which is 
thought to be the chloroplast transit peptide. These primers will also add an Ncol site at the 
5' end and an EcoRI site at the 3' end which can be used for sub-cloning into subsequent 
vectors. The PCR product from using these primers and pCGN 108 17 was ligated into pGEM 
T easy and the resulting vector pMON21689 was confirmed by sequencing using the 
mBforward and ml3reverse primers. The NcoI/EcoRI fragment from pMON21689 was then 
ligated with the Eagl/EcoRI andEagl/Ncol fragments from psll21 1 resulting in 
pMON21690. The plasmid pMON21690 was introduced into the sir 1736 Synechocystis 6803 
KO strain via conjugation. Cells of sl906 (a helper strain) and DH 10B cells containing 
pMON21690 were grown to log phase (O.D. 600= 0.4) and 1 ml was harvested by 
centrifugation. The cell pellets were washed twice with a sterile BG-1 1 solution and 
resuspended in 200 ul of BG- 1 1 . The following was mixed in a sterile eppendorf tube: 50 ul 
SL906, 50 ul DH10B cells containing pMON21690, and 100 ul of a fresh culture of the 
slrl736 Synechocystis 6803 KO strain (O.D. 730 = 0.2-0.4). The cell mixture was 
immediately transferred to a nitrocellulose filter resting on BG-1 1 and incubated for 24 hours 
at 30C and 2500 LUX(50 ue) of light. The filter was then transferred to BG-1 1 supplemented 
with lOug/ml Gentamycin and incubated as above for -5 days. When colonies appeared, they 
were picked and grown up in liquid BG-1 1 + Gentamycin 10 ug/ml. (Elhai, J. and Wolk, P. 
1988. Conjugal transfer of DNA to Cyanobacteria. Methods in Enzymology 167, 747-54) 
The liquid cultures were then assayed for tocopherols by harvesting 1ml of culture by 
centrifugation, extracting with ethanol/pyrogallol, and HPLC separation. The slrl736 
Synechocystis 6803 KO strain, did not contain any detectable tocopherols, while the slrl736 
Synechocystis 6803 KO strain transformed with pmon21690 contained detectable alpha 
tocopherol. A Synechocystis 6803 strain transformed with psll 21 1 (vector control) produced 
alpha tocopherol as well. 
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Example 5: Transgenic Plant Analysis 



Arabidopsis plants transformed with constructs for the sense or antisense expression 
5 of the ATPT proteins were analyzed by High Pressure Liquid Chromatography (HPLC) for 
altered levels of total tocopherols, as well as altered levels of specific tocopherols (alpha, 
beta, gamma, and delta tocopherol). 

Extracts of leaves and seeds were prepared for HPLC as follows. For seed extracts, 
10 mg of seed was added to 1 g of microbeads (Biospec) in a sterile microfuge tube to which 
10 500 ul 1% pyrogallol (Sigma Chem)/ethanol was added. The mixture was shaken for 3 

minutes in a mini Beadbeater (Biospec) on "fast" speed. The extract was filtered through a 
0.2 um filter into an autosampler tube. The filtered extracts were then used in HPLC analysis 
described below. 

Leaf extracts were prepared by mixing 30-50 mg of leaf tissue with 1 g microbeads 
15 and freezing in liquid nitrogen until extraction. For extraction, 500 ul 1% pyrogallol in 
ethanol was added to the leaf/bead mixture and shaken for 1 minute on a Beadbeater 
(Biospec) on "fast" speed. The resulting mixture was centrifuged for 4 minutes at 14,000 rpm 
and filtered as described above prior to HPLC analysis. 

HPLC was performed on a Zorbax silica HPLC column (4.6 mm X 250 mm) with a 

2 0 fluorescent detection, an excitation at 290 nm, an emission at 336 nm, and bandpass and slits. 

Solvent A was hexane and solvent B was methyl-t-butyl ether. The injection volume was 20 
ul, the flow rate was 1.5 ml/min, the run time was 12 min (40°C) using the gradient (Table 5): 

Table 5: 

25 Time Solvent A 

0 min. 90% 

10 min. 90% 

11 min. 25% 

12 min. 90% 

30 

Tocopherol standards in 1% pyrogallol/ ethanol were also run for comparison (alpha 
tocopherol, gamma tocopherol, beta tocopherol, delta tocopherol, and tocopherol (tocol) (all 
from Matreya). 

Standard curves for alpha, beta, delta, and gamma tocopherol were calculated using 

3 5 Chemstation software. The absolute amount of component x is: Absolute amount of x= 



Solvent B 

10% 

10% 

75% 

10% 
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Response x x RF X x dilution factor where Response x is the area of peak x, RF X is the response 
factor for component x (Amount x /Response x ) and the dilution factor is 500 ul. The ng/mg 
tissue is found by: total ng component/mg plant tissue. 

Results of the HPLC analysis of seed extracts of transgenic Arabidopsis lines 
containing pMON 10822 for the expression of ATAT2 from the napin promoter are provided 
in Figure 24. 

HPLC analysis results of Arabidopsis seed tissue expressing the ATAT2 sequence 
from the napin promoter (pMON 10822) demonstrates an increased level of tocopherols in the 
seed. Total tocopherol levels are increased as much as 50 to 60% over the total tocopherol 
levels of non-transformed (wild-type) Arabidopsis plants (Figure 24). 

Furthermore, increases of particular tocopherols are also increased in transgenic 
Arabidopsis plants expressing the ATAT2 nucleic acid sequence from the napin promoter. 
Levels of delta tocopherol in these lines are increased greater than 3 fold over the delta 
tocopherol levels obtained from the seeds of wild type Arabidopsis lines. Levels of gamma 
tocopherol in transgenic Arabidopsis lines expressing the ATAT2 nucleic acid sequence are 
increased as much as about 60% over the levels obtained in the seeds of non-transgenic 
control lines. Furthermore, levels of alpha tocopherol are increased as much as 3 fold over 
those obtained from non-transgenic control lines. 

Results of the HPLC analysis of seed extracts of transgenic Arabidopsis lines 
containing pMON 10803 for the expression of ATAT2 from the enhanced 35S promoter are 
provided in Figure 25. 

All publications and patent applications mentioned in this specification are indicative 
of the level of skill of those skilled in the art to which this invention pertains. All 
publications and patent applications are herein incorporated by reference to the same extent as 
if each individual publication or patent application was specifically and individually indicated 
to be incorporated by reference. 

Although the foregoing invention has been described in some detail by way of 
illustration and example for purposes of clarity of understanding, it will be obvious that 
certain changes and modifications may be practiced within the scope of the appended claim. 
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Claims 

What is Claimed is: 

L An isolated nucleic acid sequence encoding a prenyltransf erase. 

2. An isolated nucleic acid sequence according to Claim 1 , wherein said prenyltransferase is 
selected from the group consisting of straight chain prenyltransferase and aromatic prenyltransferase. 

3. An isolated DNA sequence according to Claim 1, wherein said nucleic acid sequence is 

isolated from a eukaryotic cell source. 

4. An isolated DNA sequence according to Claim 3, wherein said eukaryotic cell source is 
selected from the group consisting of mammalian, nematode, fungal, and plant cells. 

5. The DNA encoding sequence of Claim 4 wherein said prenyltransferase protein is from 

Arabidopsis. 

6. The DNA encoding sequence of Claim 5 wherein said prenyltransferase protein is encoded 
by a sequence selected from the group consisting of the sequences of Figure 1 . 

7. The DNA encoding sequence of Claim 4 wherein said prenyltransferase protein is from 

corn. 

8. The DNA encoding sequence of Claim 7 wherein said prenyltransferase protein is encoded 
by a sequence which includes the EST of the sequences of Figure 3. 

9. The DNA encoding sequence of Claim 4 wherein said prenyltransferase protein is from 

soybean. 

10. The DNA encoding sequence of Claim 9 wherein said prenyltransferase protein is 
encoded by a sequence which includes the ESTs of the group consisting of the sequences of Figure 2 
and Figure 9. 

1 1 . An isolated DNA sequence according to Claim 1 , wherein said nucleic acid sequence is 

isolated from a prokaryotic cell source. 

12. An isolated DNA sequence according to Claim 11, wherein said prokaryotic source is 

Synechocystis. 

13. A nucleic acid construct comprising as operably linked components, a transcriptional 
initiation region functional in a host cell, a nucleic acid sequence encoding prenyltransferase, and a 
transcriptional termination region. 

14. A nucleic acid construct according to Claim 13, wherein said nucleic acid sequence 
encoding prenyltransferase is obtained from an organism selected from the group consisting of a 
eukaryotic organism and a prokaryotic organism. 
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15. A nucleic acid construct according to Claim 14, wherein said nucleic acid sequence 
encoding prenyltransferase is obtained from a plant source. 

16. A nucleic acid construct according to Claim 15, wherein said nucleic acid sequence 
encoding prenyltransferase is obtained from a source selected from the group consisting of 
Arabidopsis, soybean and corn. 

17. A nucleic acid construct according to Claim 13, wherein said nucleic acid sequence 
encoding prenyltransferase is obtained from Synechocystis. 

18. A plant cell comprising the construct of Claiml3. 

19. A method for the alteration of the tocopherol content in a host cell, comprising; 
transforming said host cell with a construct comprising as operably linked components, a 
transcriptional initiation region functional in a host cell, a nucleic acid sequence encoding 
prenyltransferase, and a transcriptional termination region. 

20. The method according to Claim 19, wherein said host cell is selected from the group 
consisting of a prokaryotic cell and a eukaryotic cell. 

21. The method according to Claim 20, wherein said prokaryotic cell is Synechocystis. 

22. The method according to Claim 20, wherein said eukaryotic cell is a plant cell. 

23. The method according to Claim 22, wherein said plant cell is obtained from a plant 
selected from the group consisting of Arabidopsis, soybean, and corn. 

24. A method for producing a tocopherol compound of interest in a host cell, said method 
comprising obtaining a transformed host cell, said host cell having and expressing in its genome: 

a construct having a DNA sequence encoding a prenyltransferase operably linked to a 
transcriptional initiation region functional in a host cell, 

wherein said prenyltransferase is involved in the synthesis of tocopherols 

25. The method according to Claim 24, wherein said host cell is selected from the group 
consisting of a prokaryotic cell and a eukaryotic cell. 

26. The method according to Claim 25, wherein said prokaryotic cell is Synechocystis. 

27. The method according to Claim 24, wherein said eukaryotic cell is a plant cell. 

28. The method according to Claim 27, wherein said plant cell is obtained from a plant 
selected from the group consisting of Arabidopsis, soybean, and corn. 

29. A method for increasing the biosynthetic flux in cell from a host cell toward 
tocopherol production, said method comprising transforming said host cell with a construct 
comprising as operably linked components, a transcriptional initiation region functional in a 
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host cell, a DNA encoding a prenyltransferase involved in the synthesis of tocopherols, and a 
transcriptional termination region. 

30. The method according to Claim 29, wherein said host cell is selected from the group 
consisting of a prokaryotic cell and a eukaryotic cell. 
5 31. The method according to Claim 30, wherein said prokaryotic cell is Synechocystis. 

32. The method according to Claim 30, wherein said eukaryotic cell is a plant cell. 

33. The method according to Claim 32, wherein said plant cell is obtained from a plant 
selected from the group consisting of Arabidopsis, soybean, and corn. 
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Figure 4 
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Figure 6 
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Figure 9 
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Figure 10 
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Figure 16 
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Figure 17 
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SEQUENCE LISTING 
<110> Calgene LLC 

5 

<120> Nucleic Acid Sequences Involved in 
Tocopherol Synthesis 

<130> 17133/00/WO 

10 

<150> 60/129,899 
<151> 1999-04-15 

<150> 60/146,461 
15 <151> 1999-07-30 

<160> 94 

<170> FastSEQ for Windows Version 4.0 

20 

<210> 1 

<211> 1182 



<212> DNA 

<213> Arabidopsis sp 

25 





<400> 1 
















atggagtctc 


tgctctctag 


ttcttctctt 


gtttccgctg 


ctggtgggtt 


ttgttggaag 


60 




aagcagaatc 


taaagctcca 


ctctttatca 


gaaatccgag 


ttctgcgttg 


tgattcgagt 


120 




aaagttgtcg 


caaaaccgaa 


gtttaggaac 


aatcttgtta 


ggcctgatgg 


tcaaggatct 


180 


30 


tcattgttgt 


tgtatccaaa 


acataagtcg 


agatttcggg 


ttaatgccac 


tgcgggtcag 


240 




cctgaggctt 


tcgactcgaa 


tagcaaacag 


aagtctttta 


gagactcgtt 


agatgcgttt 


300 




tacaggtttt 


ctaggcctca 


tacagttatt 


ggcacagtgc 


ttagcatttt 


atctgtatct 


360 




ttcttagcag 


tagagaaggt 


ttctgatata 


tctcctttac 


ttttcactgg 


catcttggag 


420 




gctgttgttg 


cagctctcat 


gatgaacatt 


tacatagttg 


ggctaaatca 


gttgtctgat 


480 


35 


gttgaaatag 


ataaggttaa 


caagccctat 


cttccattgg 


catcaggaga 


atattctgtt 


540 




aacaccggca 


ttgcaatagt 


agcttccttc 


tccatcatga 


gtttctggct 


tgggtggatt 


600 




gttggttcat 


ggccattgtt 


ctgggctctt 


tttgtgagtt 


tcatgctcgg 


tactgcatac 


660 




tctatcaatt 


tgccactttt 


acggtggaaa 


agatttgcat 


tggttgcagc 


aatgtgtatc 


720 




ctcgctgtcc 


gagctattat 


tgttcaaatc 


gccttttatc 


tacatattca 


gacacatgtg 


780 


40 


tttggaagac 


caatcttgtt 


cactaggcct 


cttattttcg 


ccactgcgtt 


tatgagcttt 


840 




ttctctgtcg 


ttattgcatt 


gtttaaggat 


atacctgata 


tcgaagggga 


taagatattc 


900 




ggaatccgat 


cattctctgt 


aactctgggt 


cagaaacggg 


tgttttggac 


atgtgttaca 


960 




ctacttcaaa 


tggcttacgc 


tgttgcaatt 


ctagttggag 


ccacatctcc 


attcatatgg 


1020 




agcaaagtca 


tctcggttgt 


gggtcatgtt 


atactcgcaa 


caactttgtg 


ggctcgagct 


1080 


45 


aagtccgttg 


atctgagtag 


caaaaccgaa 


ataacttcat 


gttatatgtt 


catatggaag 


1140 




ctcttttatg 


cagagtactt 


gctgttacct 


tttttgaagt 


ga 
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<210> 2 
<211> 393 
<212> PRT 

<213> Arabidopsis sp 



<400> 2 

Met Glu Ser Leu Leu Ser Ser Ser Ser Leu Val Ser Ma Ala Gly Gly 

15 10 15 

Phe Cys Trp Lys Lys Gin Asn Leu Lys Leu His Ser Leu Ser Glu lie 

20 25 30 

Arg Val Leu Arg Cys Asp Ser Ser Lys Val Val Ala Lys Pro Lys Phe 

35 40 45 

Arg Asn Asn Leu Val Arg Pro Asp Gly Gin Gly Ser Ser Leu Leu Leu 

50 55 60 

Tyr Pro Lys His Lys Ser Arg Phe Arg Val Asn Ala Thr Ala Gly Gin 
65 70 75 80 

Pro Glu Ala Phe Asp Ser Asn Ser Lys Gin Lys Ser Phe Arg Asp Ser 

85 90 95 

Leu Asp Ala Phe Tyr Arg Phe Ser Arg Pro His Thr Val lie Gly Thr 

100 105 HO 

Val Leu Ser He Leu Ser Val Ser Phe Leu Ala Val Glu Lys Val Ser 

115 120 125 

Asp He Ser Pro Leu Leu Phe Thr Gly He Leu Glu Ala Val Val Ala 

130 135 140 

Ala Leu Met Met Asn He Tyr He Val Gly Leu Asn Gin Leu Ser Asp 
145 150 155 160 

Val Glu He Asp Lys Val Asn Lys Pro Tyr Leu Pro Leu Ala Ser Gly 

165 170 175 

Glu Tyr Ser Val Asn Thr Gly He Ala He Val Ala Ser Phe Ser He 

180 185 190 

Met Ser Phe Trp Leu Gly Trp He Val Gly Ser Trp Pro Leu Phe Trp 

195 200 205 

Ala Leu Phe Val Ser Phe Met Leu Gly Thr Ala Tyr Ser He Asn Leu 

210 215 220 

Pro Leu Leu Arg Trp Lys Arg Phe Ala Leu Val Ala Ala Met Cys He 
225 230 235 240 

Leu Ala Val Arg Ala He He Val Gin He Ala Phe Tyr Leu His He 

245 250 255 

Gin Thr His Val Phe Gly Arg Pro He Leu Phe Thr Arg Pro Leu He 

260 265 270 

Phe Ala Thr Ala Phe Met Ser Phe Phe Ser Val Val He Ala Leu Phe • 

275 280 285 

Lys Asp He Pro Asp He Glu Gly Asp Lys He Phe Gly He Arg Ser 

290 295 300 

Phe Ser Val Thr Leu Gly Gin Lys Arg Val Phe Trp Thr Cys Val Thr 
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305 310 315 3^0 

Leu Leu Gin Met Ala Tyr Ala Val Ala He Leu Val Gly Ala Thr Ser 

325 330 335 

Pro Phe He Trp Ser Lys Val He Ser Val Val Gly His Val He Leu 
5 340 345 350 

Ala Thr Thr Leu Trp Ala Arg Ala Lys Ser Val Asp Leu Ser Ser Lys 

355 360 365 

Thr Glu He Thr Ser Cys Tyr Met Phe He Trp Lys Leu Phe Tyr Ala 
370 375 380 

10 Glu Tyr Leu Leu Leu Pro Phe Leu Lys 
385 390 



<210> 3 
<211> 1224 
15 <212> DNA 

<213> Arabidopsis sp 



<400> 3 

atggcgtttt ttgggctctc ccgtgtttca agacggttgt tgaaatcttc cgtctccgta 

20 actccatctt cttcctctgc tcttttgcaa tcacaacata aatccttgtc caatcctgtg 

actacccatt acacaaatcc tttcactaag tgttatcctt catggaatga taattaccaa 



40 



45 



tctttgttac ttgtcttttc ctacccactt atgaagaggt ttacattttg gcctcaagcc 
tttttaggtt tgaccataaa ctggggagca ttgttaggat ggactgcagt taaaggaagc 
atagcaccat ctattgtact ccctctctat ctctccggag tctgctggac ccttgtttat 
gatactattt atgcacatca ggacaaagaa gatgatgtaa aagttggtgt taagtcaaca 



<210> 4 
<211> 407 
<212> PRT 

<213> Arabidopsis sp 
<400> 4 



60 
120 
180 



gtatggagta aaggaagaga attgcatcag gagaagtttt ttggtgttgg ttggaattac 240 



300 



agattaattt gtggaatgtc gtcgtcttct tcggttttgg agggaaagcc gaagaaagat 
gataaggaga agagtgatgg tgttgttgtt aagaaagctt cttggataga tttgtattta 3 60 

2 5 ccagaagaag ttagaggtta tgctaagctt gctcgattgg ataaacccat tggaacttgg 

ttgcttgcgt ggccttgtat gtggtcgatt gcgttggctg ctgatcctgg aagccttcca 
agttttaaat atatggcttt atttggttgc ggagcattac ttcttagagg tgctggttgt 
actataaatg atctgcttga tcaggacata gatacaaagg ttgatcgtac aaaactaaga 
cctatcgcca gtggtctttt gacaccattt caagggattg gatttctcgg gctgcagttg 

3 0 cttttaggct tagggattct tctccaactt aacaattaca gccgtgtttt aggggcttca 720 

780 
840 
900 
960 



420 
480 
540 
600 
660 



3 5 gcccttagat tcggtgataa tacaaagctt tggttaactg gatttggcac agcatccata 1020 



1080 



ggttttcttg cactttctgg attcagtgca gatctcgggt ggcaatatta cgcatcactg 
gccgctgcat caggacagtt aggatggcaa atagggacag ctgacttatc atctggtgct 1140 
gactgcagta gaaaatttgt gtcgaacaag tggtttggtg ctattatatt tagtggagtt 1200 
gtacttggaa gaagttttca ataa 



1224 
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Met Ala Phe Phe Gly Leu Ser Arg Val Ser Arg Arg Leu Leu Lys Ser 

15 10 15 

Ser Val Ser Val Thr Pro Ser Ser Ser Ser Ala Leu Leu Gin Ser Gin 

20 25 30 

5 His Lys Ser Leu Ser Asn Pro Val Thr Thr His Tyr Thr Asn Pro Phe 

35 40 45 

Thr Lys Cys Tyr Pro Ser Trp Asn Asp Asn Tyr Gin Val Trp Ser Lys 

50 55 60 

Gly Arg Glu Leu His Gin Glu Lys Phe Phe Gly Val Gly Trp Asn Tyr 
10 65 70 75 80 

Arg Leu He Cys Gly Met Ser Ser Ser Ser Ser Val Leu Glu Gly Lys 

85 90 95 

Pro Lys Lys Asp Asp Lys Glu Lys Ser Asp Gly Val Val Val Lys Lys 

100 105 110 

15 Ala Ser Trp He Asp Leu Tyr Leu Pro Glu Glu Val Arg Gly Tyr Ala 

115 120 125 

Lys Leu Ala Arg Leu Asp Lys Pro He Gly Thr Trp Leu Leu Ala Trp 

130 135 140 

Pro Cys Met Trp Ser He Ala Leu Ala Ala Asp Pro Gly Ser Leu Pro 
20 145 150 155 160 

Ser Phe Lys Tyr Met Ala Leu Phe Gly Cys Gly Ala Leu Leu Leu Arg 

165 170 175 

Gly Ala Gly Cys Thr He Asn Asp Leu Leu Asp Gin Asp He Asp Thr 

180 185 190 

25 Lys Val Asp Arg Thr Lys Leu Arg Pro lie Ala Ser Gly Leu Leu Thr 

195 200 205 

Pro Phe Gin Gly He Gly Phe Leu Gly Leu Gin Leu Leu Leu Gly Leu 

210 215 220 

Gly He Leu Leu Gin Leu Asn Asn Tyr Ser Arg Val Leu Gly Ala Ser 
30 225 230 235 240 

Ser Leu Leu Leu Val Phe Ser Tyr Pro Leu Met Lys Arg Phe Thr Phe 

245 250 255 

Trp Pro Gin Ala Phe Leu Gly Leu Thr He Asn Trp Gly Ala Leu Leu 

260 265 270 

35 Gly Trp Thr Ala Val Lys Gly Ser He Ala Pro Ser He Val Leu Pro 

275 280 285 

Leu Tyr Leu Ser Gly Val Cys Trp Thr Leu Val Tyr Asp Thr He Tyr 

290 295 300 

Ala His Gin Asp Lys Glu Asp Asp Val Lys Val Gly Val Lys Ser Thr 
40 305 310 315 320 

Ala Leu Arg Phe Gly Asp Asn Thr Lys Leu Trp Leu Thr Gly Phe Gly 

325 330 335 

Thr Ala Ser He Gly Phe Leu Ala Leu Ser Gly Phe Ser Ala Asp Leu 

340 345 350 

45 Gly Trp Gin Tyr Tyr Ala Ser Leu Ala Ala Ala Ser Gly Gin Leu Gly 

355 360 365 
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Trp Gin lie Gly Thr Ala Asp Leu 

370 375 
Lys Phe Val Ser Asn Lys Trp Phe 
385 390 
5 Val Leu Gly Arg Ser Phe Gin 

405 



Ser Ser Gly Ala Asp Cys Ser Arg 

380 

Gly Ala lie lie Phe Ser Gly Val 
395 400 



<210> 5 
<211> 1296 
10 <212> DNA 

<213> Arabidopsis sp 





<400> 5 


















aatc tcrttcrt 


ttctcgttta 


tcttcaagaa 


tctctgtttc 


ttcttcgtta 


60 


IS 


rraaaccc ta 


aac tcra ttcc 


ttggtcccgc 


gaattatgtg 


ccgttaatag 


cttctcccag 


120 




\-r \^ i-v^y y 


caacaaaatc 

%j T_n 


aactgctaag 


ttagggatca 


ctggtgttag 


atctgatgcc 


180 




ass f fCraot" tt 


ttaccactgc 


tactgccgcc 


gctacagcta 


cagctaccac 


cggtgagatt 


240 




tcatctacraQ 


ttgcggcttt 


ggctggatta 


gggcatcact 


acgctcgttg 


ttattgggag 


300 




cuLucuaaag 


C tddcic u L.cty 




attcrcaac tt 


ctqqaactgg 


gtatattctg 


360 


20 


ggtacgggaa 


atgctgcaat 


tagcttcccg 


gggctttgtt 


acacatgtgc 


aggaaccatg 


420 




atgattgctg 


catctgctaa 


ttccttgaat 


cagatttttg 


agataagcaa 


tgattctaag 


480 




atgaaaagaa 


cgatgctaag 


gccattgcct 


tcaggacgta 


ttagtgttcc 


acacgctgtt 


540 




gcatgggcta 


ctattgctgg 


tgcttctggt 


gcttgtttgt 


tggccagcaa 


gactaatatg 


600 




ttggctgctg 


gacttgcatc 


tgccaatctt 


gtactttatg 


cgtttgttta 


tactccgttg 


660 


25 


aagcaacttc 


accctatcaa 


tacatgggtt 


ggcgctgttg 


ttggtgctat 


cccacccttg 


720 




cttgggtggg 


cggcagcgtc 


tggtcagatt 


tcatacaatt 


cgatgattct 


tccagctgct 


780 




ctttactttt 


ggcagatacc 


tcattttatg 


gcccttgcac 


atctctgccg 


caatgattat 


840 




gcagctggag 


gttacaagat 


gttgtcactc 


tttgatccgt 


cagggaagag 


aatagcagca 


900 




gtggctctaa 


ggaactgctt 


ttacatgatc 


cctctcggtt 


tcatcgccta 


tgactggggg 


960 


30 


ttaacctcaa 


gttggttttg 


cctcgaatca 


acacttctca 


cactagcaat 


cgctgcaaca 


1020 




gcattttcat 


tctaccgaga 


ccggaccatg 


cataaagcaa 


ggaaaatgtt 


ccatgccagt 


1080 




cttctcttcc 


ttcctgtttt 


catgtctggt 


cttcttctac 


accgtgtctc 


taatgataat 


1140 




cagcaacaac 


tcgtagaaga 


agccggatta 


acaaattctg 


tatctggtga 


agtcaaaact 


1200 




cagaggcgaa 


agaaacgtgt 


ggctcaacct 


ccggtggctt 


atgcctctgc 


tgcaccgttt 


1260 


35 


cctttcctcc 


cagctccttc 


cttctactct 


ccatga 






1296 



<210> 6 
<211> 431 
<212> PRT 
40 <213> Arabidopsis sp 



<400> 6 

Met Trp Arg Arg Ser Val Val Tyr 
1 5 
45 Ser Ser Ser Leu Pro Asn Pro Arg 

20 



Arg Phe Ser Ser Arg lie Ser Val 

10 15 
Leu lie Pro Trp Ser Arg Glu Leu 
25 30 
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Cys Ala Val Asn 
35 

Ala Lys Leu Gly 
50 

Ala Thr Ala Thr 
65 

Ser Ser Arg Val 

Cys Tyr Trp Glu 

100 

Thr Ser Gly Thr 
115 

Phe Pro Gly Leu 
130 

Ser Ala Asn Ser 
145 

Met Lys Arg Thr 

Pro His Ala Val 

180 

Leu Leu Ala Ser 
195 

Asn Leu Val Leu 
210 

Pro lie Asn Thr 
225 

Leu Gly Trp Ala 

Leu Pro Ala Ala 

260 

Ala His Leu Cys 
275 

Ser Leu Phe Asp 
290 

Asn Cys Phe Tyr 
305 

Leu Thr Ser Ser 

lie Ala Ala Thr 

340 

Ala Arg Lys Met 
355 

Ser Gly Leu Leu 
370 

Val Glu Glu Ala 
385 



Ser Phe Ser Gin 

40 

He Thr Gly Val 
55 

Ala Ala Ala Thr 
70 

Ala Ala Leu Ala 
85 

Leu Ser Lys Ala 

Gly Tyr He Leu 

120 

Cys Tyr Thr Cys 
135 

Leu Asn Gin He 
150 

Met Leu Arg Pro 
165 

Ala Trp Ala Thr 

Lys Thr Asn Met 

200 

Tyr Ala Phe Val 
215 

Trp Val Gly Ala 
230 

Ala Ala Ser Gly 
245 

Leu Tyr Phe Trp 

Arg Asn Asp Tyr 

280 

Pro Ser Gly Lys 
295 

Met He Pro Leu 
310 

Trp Phe Cys Leu 
325 

Ala Phe Ser Phe 

Phe His Ala Ser 

360 

Leu His Arg Val 
375 

Gly Leu Thr Asn 
390 



Pro Pro Val Ser 

Arg Ser Asp Ala 

60 

Ala Thr Ala Thr 
75 

Gly Leu Gly His 
90 

Lys Leu Ser Met 
105 

Gly Thr Gly Asn 

Ala Gly Thr Met 

140 

Phe Glu He Ser 
155 

Leu Pro Ser Gly 
170 

He Ala Gly Ala 
185 

Leu Ala Ala Gly 

Tyr Thr Pro Leu 

220 

Val Val Gly Ala 
235 

Gin He Ser Tyr 
250 

Gin He Pro His 
265 

Ala Ala Gly Gly 

Arg He Ala Ala 

300 

Gly Phe He Ala 
315 

Glu Ser Thr Leu 
330 

Tyr Arg Asp Arg 
345 

Leu Leu Phe Leu 

Ser Asn Asp Asn 

380 

Ser Val Ser Gly 
395 



Thr Glu Ser Thr 
45 

Asn Arg Val Phe 

Thr Gly Glu He 

80 

His Tyr Ala Arg 
95 

Leu Val Val Ala 
110 

Ala Ala He Ser 
125 

Met He Ala Ala 

Asn Asp Ser Lys 

160 

Arg He Ser Val 
175 

Ser Gly Ala Cys 
190 

Leu Ala Ser Ala 
205 

Lys Gin Leu His 

He Pro Pro Leu 

240 

Asn Ser Met He 
255 

Phe Met Ala Leu 
270 

Tyr Lys Met Leu 
285 

Val Ala Leu Arg 

Tyr Asp Trp Gly 

320 

Leu Thr Leu Ala 
335 

Thr Met His Lys 
350 

Pro Val Phe Met 
365 

Gin Gin Gin Leu 

Glu Val Lys Thr 

400 
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Gin Arg Arg Lys Lys Arg Val Ala Gin Pro Pro Val Ala Tyr Ala Ser 

405 410 415 

Ala Ala Pro Phe Pro Phe Leu Pro Ala Pro Ser Phe Tyr Ser Pro 

420 425 430 

5 

<210> 7 
<211> 479 
<212> DNA 

<213> Arabidopsis sp 

10 

<400> 7 



ggaaactccc ggagcacctg tttgcaggta ccgctaacct taatcgataa tttatttctc 60 

ttgtcaggaa ttatgtaagt ctggtggaag gctcgcatac catttttgca ttgcctttcg 120 

ctatgatcgg gtttactttg ggtgtgatga gaccaggcgt ggctttatgg tatggcgaaa 180 

15 acccattttt atccaatgct gcattccctc ccgatgattc gttctttcat tcctatacag 240 

gtatcatgct gataaaactg ttactggtac tggtttgtat ggtatcagca agaagcgcgg 3 00 

cgatggcgtt taaccggtat ctcgacaggc attttgacgc gaagaacccg cgtactgcca 3 60 

tccgtgaaat acctgcgggc gtcatatctg ccaacagtgc gctggtgttt acgataggct 420 

gctgcgtggt attctgggtg gcctgttatt tcattaacac gatctgtttt tacctggcg 479 



20 

<210> 8 

<211> 551 

<212> DNA 

<213> Arabidopsis sp 

25 

<220> 

<221> misc_f eature 

<222> (1) . . . (551) 

<223> n = A,T,C or G 

30 

<400> 8 



ttgtggctta caccttaatg agcatacgcc agnccattac ggctcgttaa tcggcgccat 6 0 

ngccggngct gntgcaccgg tagtgggcta ctgcgccgtg accaatcagc ttgatctagc 120 

ggctcttatt ctgtttttaa ttttactgtt ctggcaaatg ccgcattttt acgcgatttc 180 

3 5 cattttcagg ctaaaagact tttcagcggc ctgtattccg gtgctgccca tcattaaaga 240 

cctgcgctat accaaaatca gcatgctggt ttacgtgggc ttatttacac tggctgctat 3 00 

catgccggcc ctcttagggt atgccggttg gatttatggg atagcggcct taattttagg 3 60 

cttgtattgg ctttatattg ccatacaagg attcaagacc gccgatgatc aaaaatggtc 42 0 

tcgtaagatg tttggatctt cgattttaat cattaccctc ttgtcggtaa tgatgcttgt 480 

40 ttaaacttac tgcctcctga agtttatata tcgataattt cagcttaagg aggcttagtg 540 

gttaattcaa t 551 



<210> 9 

<211> 297 

45 <212> PRT 

<213> Arabidopsis sp 
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<400> 9 

Met Val Leu Ala Glu Val Pro Lys Leu Ala Ser Ala Ala Glu Tyr Phe 
15 10 15 

5 Phe Lys Arg Gly Val Gin Gly Lys Gin Phe Arg Ser Thr lie Leu Leu 

20 25 30 

Leu Met Ala Thr Ala Leu Asn Val Arg Val Pro Glu Ala Leu lie Gly 

35 40 45 

Glu Ser Thr Asp lie Val Thr Ser Glu Leu Arg Val Arg Gin Arg Gly 
10 50 55 60 

lie Ala Glu He Thr Glu Met He His Val Ala Ser Leu Leu His Asp 
65 70 75 80 

Asp Val Leu Asp Asp Ala Asp Thr Arg Arg Gly Val Gly Ser Leu Asn 

85 90 95 

15 Val Val Met Gly Asn Lys Val Val Ala Leu Leu Ala Thr Ala Val Glu 

100 105 110 

His Leu Val Thr Gly Glu Thr Met Glu He Thr Ser Ser Thr Glu Gin 

115 120 125 

Arg Tyr Ser Met Asp Tyr Tyr Met Gin Lys Thr Tyr Tyr Lys Thr Ala 
20 130 135 140 

Ser Leu He Ser Asn Ser Cys Lys Ala Val Ala Val Leu Thr Gly Gin 
145 150 155 160 

Thr Ala Glu Val Ala Val Leu Ala Phe Glu Tyr Gly Arg Asn Leu Gly 

165 170 175 

25 Leu Ala Phe Gin Leu He Asp Asp He Leu Asp Phe Thr Gly Thr Ser 

180 185 190 

Ala Ser Leu Gly Lys Gly Ser Leu Ser Asp He Arg His Gly Val He 

195 200 205 

Thr Ala Pro He Leu Phe Ala Met Glu Glu Phe Pro Gin Leu Arg Glu 
30 210 215 220 

Val Val Asp Gin Val Glu Lys Asp Pro Arg Asn Val Asp He Ala Leu 
225 230 235 240 

Glu Tyr Leu Gly Lys Ser Lys Gly He Gin Arg Ala Arg Glu Leu Ala 

245 250 255 

35 Met Glu His Ala Asn Leu Ala Ala Ala Ala He Gly Ser Leu Pro Glu 

260 265 270 

Thr Asp Asn Glu Asp Val Lys Arg Ser Arg Arg Ala Leu He Asp Leu 

275 280 285 

Thr His Arg Val He Thr Arg Asn Lys 
40 290 295 

<210> 10 
<211> 561 
<212> DNA 
45 <213> Arabidopsis sp 
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<400> 10 

aagcgcatcc gtcctcttct acgattgccg ccagccgcat gtatggctgc ataaccgacc 
gcccctatcc gctcgcggcc gcggtcgaat tcattcacac cgcgacgctg ctgcatgacg 
acgtcgtcga tgaaagcgat ttgcgccgcg gccgcgaaag cgcgcataag gttttcggca 
5 atcaggcgag cgtgctcgtc ggcgatttcc ttttctcccg cgccttccag ctgatggtgg 
aagacggctc gctcgacgcg ctgcgcattc tctcggatgc ctccgccgtg atcgcgcagg 
gcgaagtgat gcagctcggc accgcgcgca atcttgaaac caatatgagc cagtatctcg 
atgtgatcag cgcgaagacc gccgcgctct ttgccgccgc ctgcgaaatc ggcccggtga 
tggcgaacgc gaaggcggaa gatgctgccg cgatgtgcga atacggcatg aatctcggta 
10 tcgccttcca gatcatcgac gaccttctcg attacggcac cggcggccac gccgagcttg 
gcaagaacac gggcgacgat t 

<210> 11 
<211> 966 
15 <212> DNA 

<213> Arabidopsis sp 

<400> 11 

atggtacttg ccgaggttcc aaagcttgcc tctgctgctg agtacttctt caaaaggggt 

20 gtgcaaggaa aacagtttcg ttcaactatt ttgctgctga tggcgacagc tctgaatgta 
cgcgttccag aagcattgat tggggaatca acagatatag tcacatcaga attacgcgta 
aggcaacggg gtattgctga aatcactgaa atgatacacg tcgcaagtct actgcacgat 
gatgtcttgg atgatgccga tacaaggcgt ggtgttggtt ccttaaatgt tgtaatgggt 
aacaagatgt cggtattagc aggagacttc ttgctctccc gggcttgtgg ggctctcgct 

25 gctttaaaga acacagaggt tgtagcatta cttgcaactg ctgtagaaca tcttgttacc 
ggtgaaacca tggaaataac tagttcaacc gagcagcgtt atagtatgga ctactacatg 
cagaagacat attataagac agcatcgcta atctctaaca gctgcaaagc tgttgccgtt 
ctcactggac aaacagcaga agttgccgtg ttagcttttg agtatgggag gaatctgggt 
ttagcattcc aattaataga cgacattctt gatttcacgg gcacatctgc ctctctcgga 

3 0 aagggatcgt tgtcagatat tcgccatgga gtcataacag ccccaatcct ctttgccatg 
gaagagtttc ctcaactacg cgaagttgtt gatcaagttg aaaaagatcc taggaatgtt 
gacattgctt tagagtatct tgggaagagc aagggaatac agagggcaag agaattagcc 
atggaacatg cgaatctagc agcagctgca atcgggtctc tacctgaaac agacaatgaa 
gatgtcaaaa gatcgaggcg ggcacttatt gacttgaccc atagagtcat caccagaaac 

35 aagtga 

<210> 12 
<211> 321 
<212> PRT 
40 <213> Arabidopsis sp 

<400> 12 

Met Val Leu Ala Glu Val Pro Lys Leu Ala Ser Ala Ala Glu Tyr Phe 
1 5 10 15 

45 Phe Lys Arg Gly Val Gin Gly Lys Gin Phe Arg Ser Thr He Leu Leu 

20 25 30 



60 
120 
180 
240 
300 
360 
420 
480 
540 
561 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
966 
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Leu Met Ala Thr Ala Leu Asn Val Arg Val Pro Glu Ala Leu lie Gly 

35 40 45 

Glu Ser Thr Asp He Val Thr Ser Glu Leu Arg Val Arg Gin Arg Gly 
50 55 60 

5 He Ala Glu He Thr Glu Met He His Val Ala Ser Leu Leu His Asp 
65 70 75 80 

Asp Val Leu Asp Asp Ala Asp Thr Arg Arg Gly Val Gly Ser Leu Asn 

85 90 95 

Val Val Met Gly Asn Lys Met Ser Val Leu Ala Gly Asp Phe Leu Leu 
10 100 105 HO 

Ser Arg Ala Cys Gly Ala Leu Ala Ala Leu Lys Asn Thr Glu Val Val 

115 120 125 

Ala Leu Leu Ala Thr Ala Val Glu His Leu Val Thr Gly Glu Thr Met 
130 135 140 

15 Glu He Thr Ser Ser Thr Glu Gin Arg Tyr Ser Met Asp Tyr Tyr Met 
145 150 155 160 

Gin Lys Thr Tyr Tyr Lys Thr Ala Ser Leu He Ser Asn Ser Cys Lys 

165 170 175 

Ala Val Ala Val Leu Thr Gly Gin Thr Ala Glu Val Ala Val Leu Ala 
20 180 185 190 

Phe Glu Tyr Gly Arg Asn Leu Gly Leu Ala Phe Gin Leu He Asp Asp 

195 200 205 

He Leu Asp Phe Thr Gly Thr Ser Ala Ser Leu Gly Lys Gly Ser Leu 
210 215 220 

25 Ser Asp He Arg His Gly Val He Thr Ala Pro He Leu Phe Ala Met 
225 230 235 240 

Glu Glu Phe Pro Gin Leu Arg Glu Val Val Asp Gin Val Glu Lys Asp 

245 250 255 

Pro Arg Asn Val Asp He Ala Leu Glu Tyr Leu Gly Lys Ser Lys Gly 
30 260 265 270 

He Gin Arg Ala Arg Glu Leu Ala Met Glu His Ala Asn Leu Ala Ala 

275 280 285 

Ala Ala He Gly Ser Leu Pro Glu Thr Asp Asn Glu Asp Val Lys Arg 
290 295 300 

35 Ser Arg Arg Ala Leu He Asp Leu Thr His Arg Val He Thr Arg Asn 
305 310 315 320 

Lys 



40 <210> 13 
<211> 621 
<212> DNA 

<213> Arabidopsis sp 
45 <400> 13 

gctttctcct ttgctaattc ttgagctttc ttgatcccac cgcgatttct aactatttca 60 
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atcgcttctt caagcgatcc aggctcacaa aactcagact caatgatctc tcttagcctt 
ggctcattct ctagcgcgaa gatcactggc gccgttatgt tacctttggc taagtcatta 
gctgcaggct tacctaactg ctctgtggac tgagtgaagt ccagaatgtc atcaactact 
tgaaaagata aaccgagatt cttcccgaac tgatacattt gctctgcgac cttgctttcg 
5 actttactga aaattgctgc tcctttggtg cttgcagcta ctaatgaagc tgtcttgtag 
taactcttta gcatgtagtc atcaagcttg acatcacaat cgaataaact cgatgcttgc 
tttatctcac cgcttgcaaa atctttgatc acctgcaaaa agataaatca agattcagac 
caaatgttct ttgtattgag tagcttcatc taatctcaga aaggaatatt acctgactta 
tgagcttaat gacttcaagg ttttcgagat ttgtaagtac catgatgctt gagcaacatg 
10 aaatccccag ctaatacagc t 

<210> 14 
<211> 741 
<212> DNA 
15 <213> Arabidopsis sp 

<400> 14 

ggtgagtttt gttaatagtt atgagattca tctatttttg tcataaaatt gtttggtttg 
gtttaaactc tgtgtataat tgcaggaaag gaaacagttc atgagctttt cggcacaaga 

20 gtagcggtgc tagctggaga tttcatgttt gctcaagcgt catggtactt agcaaatctc 
gagaatcttg aagttattaa gctcatcagt caggtactta gttactctta cattgttttt 
ctatgaggtt gagctatgaa tctcatttcg ttgaataatg ctgtgcctca aacttttttt 
catgttttca ggtgatcaaa gactttgcaa gcggagagat aaagcaggcg tccagcttat 
ttgactgcga caccaagctc gacgagtact tactcaaaag tttctacaag acagcctctt 

25 tagtggctgc gagcaccaaa ggagctgcca ttttcagcag agttgagcct gatgtgacag 
aacaaatgta cgagtttggg aagaatctcg gtctctcttt ccagatagtt gatgatattt 
tggatttcac tcagtcgaca gagcagctcg ggaagccagc agggagtgat ttggctaaag 
gtaacttaac agcacctgtg attttcgctc tggagaggga gccaaggcta agagagatca 
ttgagtcaaa gttctgtgag gcgggttctc tggaagaagc gattgaagcg gtgacaaaag 

30 gtggggggat taagagagca c 

<210> 15 
<211> 1087 
<212> DNA 
35 <213> Arabidopsis sp 

<400> 15 

cctcttcagc caatccagag gaagaagaga caacttttta tctttcgtca agagtctccg 
aaaacgcacg gttttatgct ctctcttctg ccctcacctc acaagacgca gggcacatga 

40 ttcaaccaga gggaaaaagc aacgataaca actctgcttt tgatttcaag ctgtatatga 
tccgcaaagc cgagtctgta aatgcggctc tcgacgtttc cgtaccgctt ctgaaacccc 
ttacgatcca agaagcggtc aggtactctt tgctagccgg cggaaaacgt gtgaggcctc 
tgctctgcat tgccgcttgt gagcttgtgg ggggcgacga ggctactgcc atgtcagccg 
cttgcgcggt cgagatgatc cacacaagct ctctcattca tgacgatctt ccgtgcatgg 

45 acaatgccga cctccgtaga ggcaagccca ccaatcacaa ggtatgttgt ttaattatat 
gaaggctcag agataatgct gaactagtgt tgaaccaatt tttgctcaaa caaggtatat 



120 
180 
240 
300 
360 
420 
480 
540 
600 
621 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
741 



60 
120 
130 
240 
300 
360 
420 
480 
540 
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10 



ggagaagaca 


tggcggtttt 


ggcaggtgat 


gcactccttg 


cattggcgtt 


tgagcacatg 


600 


acggttgtgt 


cgagtgggtt 


ggtcgctccc 


gagaagatga 


ttcgcgccgt 


ggttgagctg 


660 


gccagggcca 


tagggactac 


agggctagtt 


gctggacaaa 


tgatagacct 


agccagcgaa 


720 


agactgaatc 


cagacaaggt 


tggattggag 


catctagagt 


tcatccatct 


ccacaaaacg 


780 


gcggcattgt 


tggaggcagc 


ggcagtttta 


ggggttataa 


tgggaggtgg 


aacagaggaa 


840 


gaaatcgaaa 


agcttagaaa 


■gtatgctagg 


tgtattggac 


tactgtttca 


ggttgttgat 


900 


gacattctcg 


acgtaacaaa 


atctactgag 


gaattgggta 


agacagccgg 


aaaagacgta 


960 


atggccggaa 


agctgacgta 


tccaaggctg 


ataggtttgg 


agggatccag 


ggaagttgca 


1020 


gagcacctga 


ggagagaagc 


agaggaaaag 


cttaaagggt 


ttgatccaag 


tcaggcggcg 


1080 


cctctgg 












1087 



<210> 16 
<211> 1164 
<212> DNA 
15 <213> Arabidopsis sp 



<400> 16 

atgacttcga ttctcaacac tgtctccacc 
cgagtcggag tcctctctct tcggaattcg 

20 ggtttctcga cgttgatcta cgaatcaccc 
actgatactg ataaagttaa atctcagaca 
attaaccagc ttctcggtat caaaggagca 
cttcagctta caaaaccagt cacttggcct 
gctgcttcag ggaactttca ttggacccca 

25 atgatgtctg gtccttgtct tactggctat 
gatatcgacg caattaatga gccatatcgt 
gaggttatta cacaagtctg ggtgctatta 
gatgtgtggg cagggcatac cactcccact 
ctatcttata tatactctgc tccacctctt 

3 0 tttgcacttg gagcaagcta tattagtttg 
actcttacgc cagatgttgt tgttctaaca 
gccattgtta acgacttcaa aagtgttgaa 
ccagtagctt ttggcaccga aactgcaaaa 
cagctttctg ttgccggata tctattagca 

3 5 gttgctttga tcattcctca gattgtgttc 
aaatacgacg tcaagtacca ggcaagcgcg 
acggcattag catcgcaaca ctga 



atccactctt 


ccagagttac 


ctccgtcgat 


60 


gattccgttg 


agttcactcg 


ccggcgttct 


120 


gggcggagat 


ttgttgtgcg 


tgcggcggag 


180 


cctgacaagg 


caccagccgg 


tggttcaagc 


240 


tctcaagaaa 


ctaataaatg 


gaagattcgt 


300 


ccactggttt 


ggggagtcgt 


ctgtggtgct 


360 


gaggatgttg 


ctaagtcgat 


tctttgcatg 


420 


acacagacaa 


tcaacgactg 


gtatgataga 


480 


ccaattccat 


ctggagcaat 


atcagagcca 


540 


ttgggaggtc 


ttggtattgc 


tggaatatta 


600 


gtcttctatc 


ttgctttggg 


aggatcattg 


660 


aagctaaaac 


aaaatggatg 


ggttggaaat 


720 


ccatggtggg 


ctggccaagc 


attgtttggc 


780 


ctcttgtaca 


gcatagctgg 


gttaggaata 


840 


ggagatagag 


cattaggact 


tcagtctctc 


900 


tggatatgcg 


ttggtgctat 


agacattact 


960 


tctgggaaac 


cttattatgc 


gttggcgttg 


1020 


cagtttaaat 


actttctcaa 


ggaccctgtc 


1080 


cagccattct 


tggtgctcgg 


aatatttgta 


1140 
1164 



<210> 17 
40 <211> 387 
<212> PRT 

<213> Arabidopsis sp 



<400> 17 

45 Met Thr Ser lie Leu Asn Thr Val Ser Thr lie His Ser Ser Arg Val 
15 10 15 
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Thr Ser Val Asp Arg Val Gly Val Leu Ser Leu Arg Asn Ser Asp Ser 

20 25 30 

Val Glu Phe Thr Arg Arg Arg Ser Gly Phe Ser Thr Leu lie Tyr Glu 
35 40 45 

5 Ser Pro Gly Arg Arg Phe Val Val Arg Ala Ala Glu Thr Asp Thr Asp 
50 55 60 

Lys Val Lys Ser Gin Thr Pro Asp Lys Ala Pro Ala Gly Gly Ser Ser 
65 70 75 80 

lie Asn Gin Leu Leu Gly lie Lys Gly Ala Ser Gin Glu Thr Asn Lys 
10 85 90 95 

Trp Lys He Arg Leu Gin Leu Thr Lys Pro Val Thr Trp Pro Pro Leu 

100 105 HO 

Val Trp Gly Val Val Cys Gly Ala Ala Ala Ser Gly Asn Phe His Trp 
115 120 125 

15 Thr Pro Glu Asp Val Ala Lys Ser He Leu Cys Met Met Met Ser Gly 
130 135 140 

Pro Cys Leu Thr Gly Tyr Thr Gin Thr He Asn Asp Trp Tyr Asp Arg 
145 150 155 160 

Asp He Asp Ala He Asn Glu Pro Tyr Arg Pro He Pro Ser Gly Ala 
20 165 170 175 

He Ser Glu Pro Glu Val He Thr Gin Val Trp Val Leu Leu Leu Gly 

180 185 190 

Gly Leu Gly He Ala Gly He Leu Asp Val Trp Ala Gly His Thr Thr 
195 200 205 

25 Pro Thr Val Phe Tyr Leu Ala Leu Gly Gly Ser Leu Leu Ser Tyr He 
210 215 220 

Tyr Ser Ala Pro Pro Leu Lys Leu Lys Gin Asn Gly Trp Val Gly Asn 
225 230 235 240 

Phe Ala Leu Gly Ala Ser Tyr He Ser Leu Pro Trp Trp Ala Gly Gin 
30 245 250 255 

Ala Leu Phe Gly Thr Leu Thr Pro Asp Val Val Val Leu Thr Leu Leu 

260 265 270 

Tyr Ser He Ala Gly Leu Gly He Ala He Val Asn Asp Phe Lys Ser 
275 280 285 

35 Val Glu Gly Asp Arg Ala Leu Gly Leu Gin Ser Leu Pro Val Ala Phe 
290 295 300 

Gly Thr Glu Thr Ala Lys Trp He Cys Val Gly Ala He Asp He Thr 
305 310 315 320 

Gin Leu Ser Val Ala Gly Tyr Leu Leu Ala Ser Gly Lys Pro Tyr Tyr 
40 325 330 335 

Ala Leu Ala Leu Val Ala Leu lie He Pro Gin He Val Phe Gin Phe 

340 345 350 

Lys Tyr Phe Leu Lys Asp Pro Val Lys Tyr Asp Val Lys Tyr Gin Ala 
355 360 365 

45 Ser Ala Gin Pro Phe Leu Val Leu Gly He Phe Val Thr Ala Leu Ala 
370 375 380 
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Ser Gin His 
385 

<210> 18 
5 <211> 981 
<212> DNA 

<213> Arabidopsis sp 



<400> 18 

10 atgttgttta gtggttcagc gatcccatta agcagcttct gctctcttcc ggagaaaccc 60 

cacactcttc ctatgaaact ctctcccgct gcaatccgat cttcatcctc atctgccccg 120 

gggtcgttga acttcgatct gaggacgtat tggacgactc tgatcaccga gatcaaccag 180 

aagctggatg aggccatacc ggtcaagcac cctgcgggga tctacgaggc tatgagatac 240 

tctgtactcg cacaaggcgc caagcgtgcc cctcctgtga tgtgtgtggc ggcctgcgag 300 

15 ctcttcggtg gcgatcgcct cgccgctttc cccaccgcct gtgccctaga aatggtgcac 3 60 

gcggcttcgt tgatacacga cgacctcccc tgtatggacg acgatcctgt gcgcagagga 420 

aagccatcta accacactgt ctacggctct ggcatggcca ttctcgccgg tgacgccctc 480 

ttcccactcg ccttccagca cattgtctcc cacacgcctc ctgaccttgt tccccgagcc 540 

accatcctca gactcatcac tgagattgcc cgcactgtcg gctccactgg tatggctgca 6 00 

20 ggccagtacg tcgaccttga aggaggtccc tttcctcttt cctttgttca ggagaagaaa 6 60 

ttcggagcca tgggtgaatg ctctgccgtg tgcggtggcc tattgggcgg tgccactgag 720 

gatgagctcc agagtctccg aaggtacggg agagccgtcg ggatgctgta tcaggtggtc 7 80 

gatgacatca ccgaggacaa gaagaagagc tatgatggtg gagcagagaa gggaatgatg 840 

gaaatggcgg aagagctcaa ggagaaggcg aagaaggagc ttcaagtgtt tgacaacaag 900 

25 tatggaggag gagacacact tgttcctctc tacaccttcg ttgactacgc tgctcatcga 960 

cattttcttc ttcccctctg a 981 



<210> 19 
<211> 245 
3 0 <212> DNA 

<213> GLycine sp 



<400> 19 

gcaacatctg ggactgggtt tgtcttgggg agtggtagtg ctgttgatct ttcggcactt 60 

35 tcttgcactt gcttgggtac catgatggtt gctgcatctg ctaactcttt gaatcaggtg 120 

tttgagatca ataatgatgc taaaatgaag agaacaagtc gcaggccact accctcagga 180 

cgcatcacaa tacctcatgc agttggctgg gcatcctctg ttggattagc tggtacggct 240 

ctact 245 



40 <210> 20 
<211> 253 
<212> DNA 
<213> Glycine sp 

45 <400> 20 

attggctttc caagatcatt gggttttctt gttgcattca tgaccttcta ctccttgggt 60 
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10 



ttggcattgt ccaaggatat acctgacgtt gaaggagata aagagcacgg cattgattct 120 
tttgcagtac gtctaggtca gaaacgggca ttttggattt gcgtttcctt ttttgaaatg 
gctttcggag ttggtatcct ggccggagca tcatgctcac acttttggac taaaattttc 
acgggtatgg gaa 



180 
240 
253 



<210> 21 
<211> 275 
<212> DNA 
<213> Glycine sp 

<400> 21 

tgatcttcta ctctctgggt atggcattgt ccaaggatat atctgacgtt aaaggagata 

aagcatacgg catcgatact ttagcgatac gtttgggtca aaaatgggta ttttggattt 120 

gcattatcct ttttgaaatg gcttttggag ttgccctctt ggcaggagca acatcttctt 180 

15 acctttggat taaaattgtc acgggtctgg gacatgctat tcttgcttca attctcttgt 240 

accaagccaa atctatatac ttgagcaaca aagtt 275 



60 



<210> 22 

<211> 299 

20 <212> DNA 

<213> Glycine sp 



<220> 

<221> misc_feature 
25 <222> (1) . . . (299) 

<223> n = A,T,C or G 



<400> 22 

ccanaatang tncatcttng aaagacaatt ggcctcttca acacacaagt ctgcatgtga 
3 0 agaagaggcc aattgtcttt ccaagatcac ttatngtggc tattgtaatc atgaacttct 
tctttgtggg tatggcattg gcaaaggata tacctanctg ttgaaggaga taaaatatat 
ggcattgata cttttgcaat acgtataggt caaaaacaag tattttggat ttgtattttc 
ctttttgaaa ggctttcgga gtttccctag tggcaggagc aacatcttct agccttggt 



35 <210> 23 

<211> 767 

<212> DNA 

<213> Glycine sp 



40 <400> 23 



gtggaggctg 


tggttgctgc 


cctgtttatg 


aatatttata 


ttgttggttt 


gaatcaattg 


60 


tctgatgttg 


aaatagacaa 


gataaacaag 


ccgtatcttc 


cattagcatc 


tggggaatat 


120 


tcctttgaaa 


ctggtgtcac 


tattgttgca 


tctttttcaa 


ttctgagttt 


ttggcttggc 


180 


tgggttgtag 


gttcatggcc 


attattttgg 


gccctttttg 


taagctttgt 


gctaggaact 


240 


gcttattcaa 


tcaatgtgcc 


tctgttgaga 


tggaagaggt 


ttgcagtgct 


tgcagcgatg 


300 


tgcattctag 


ctgttcgggc 


agtaatagtt 


caacttgcat 


ttttccttca 


catgcagact 


360 
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catgtgtaca 


agaggccacc 


tgtcttttca 


agaccattga 


tttttgctac 


tgcattcatg 


420 


agcttcttct 


ctgtagttat 


agcactgttt 


aaggatatac 


ctgacattga 


aggagataaa 


480 


gtatttggca 


tccaatcttt 


ttcagtgtgt 


ttaggtcaga 


agccggtgtt 


ctggacttgt 


540 


gttacccttc 


ttgaaatagc 


ttatggagtc 


gccctcctgg 


tgggagctgc 


atctccttgt 


600 


ctttggagca 


aaattttcac 


gggtctggga 


cacgctgtgc 


tggcttcaat 


tctctggttt 


660 


catgccaaat 


ctgtagattt 


gaaaagcaaa 


gcttcgataa 


catccttcta 


tatgtttatt 


720 


tggaagctat 


tttatgcaga 


atacttactc 


attccttttg 


ttagatg 




767 



<210> 24 

10 <211> 255 

<212> PRT 

<213> Glycine sp 



<400> 24 

15 Val Glu Ala Val Val Ala Ala Leu Phe Met Asn He Tyr He Val Gly 
15 10 15 

Leu Asn Gin Leu Ser Asp Val Glu He Asp Lys He Asn Lys Pro Tyr 

20 25 30 

Leu Pro Leu Ala Ser Gly Glu Tyr Ser Phe Glu Thr Gly Val Thr He 
20 35 40 45 

Val Ala Ser Phe Ser He Leu Ser Phe Trp Leu Gly Trp Val Val Gly 

50 55 60 

Ser Trp Pro Leu Phe Trp Ala Leu Phe Val Ser Phe Val Leu Gly Thr 
65 70 75 80 

25 Ala Tyr Ser He Asn Val Pro Leu Leu Arg Trp Lys Arg Phe Ala Val 

85 90 95 

Leu Ala Ala Met Cys He Leu Ala Val Arg Ala Val He Val Gin Leu 

100 105 HO 

Ala Phe Phe Leu His Met Gin Thr His Val Tyr Lys Arg Pro Pro Val 
30 115 120 125 

Phe Ser Arg Pro Leu He Phe Ala Thr Ala Phe Met Ser Phe Phe Ser 

130 135 140 

Val Val He Ala Leu Phe Lys Asp He Pro Asp He Glu Gly Asp Lys 
145 150 155 160 

35 Val Phe Gly He Gin Ser Phe Ser Val Cys Leu Gly Gin Lys Pro Val 

165 170 175 

Phe Trp Thr Cys Val Thr Leu Leu Glu He Ala Tyr Gly Val Ala Leu 

180 185 190 

Leu Val Gly Ala Ala Ser Pro Cys Leu Trp Ser Lys He Phe Thr Gly 
40 195 200 205 

Leu Gly His Ala Val Leu Ala Ser He Leu Trp Phe His Ala Lys Ser 

210 215 220 

Val Asp Leu Lys Ser Lys Ala Ser He Thr Ser Phe Tyr Met Phe He 
225 230 235 240 

45 Trp Lys Leu Phe Tyr Ala Glu Tyr Leu Leu He Pro Phe Val Arg 

245 250 255 
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<210> 25 
<211> 360 
<212> DNA 
5 <213> Zea sp 

<220> 

<221> misc_f eature 
<222> (1)...{360) 
10 <223> n = A,T,C or G 



<400> 25 

ggcgtcttca cttgttctgg tcttctcgta tcccctgatg aagaggttca cattttggcc 60 

tcaggcttat cttggcctga cattcaactg gggagcttta ctagggtggg ctgctattaa 120 

15 ggaaagcata gaccctgcaa atcatccttc cattgtatac agctggtatt tgttggacgc 180 

tggtgtatga tactatatat gcgcatcagg tgtttcgcta tccctacttt catattaatc 240 

cttgatgaag tggccatttc atgttgtcgc ggtggtctta tacttgcata tctccatgca 300 

tctcaggaca aagangatga cctgaaagta ggagtccaag tccacagctt aagatttggg 3 60 



20 <210> 26 

<211> 299 

<212> DNA 

<213> Zea sp 

25 <220> 

<221> misc_f eature 

<222> (1) . . . (299) 

<223> n = A,T,C or G 

30 <400> 26 



gatggttgca 


gcatctgcaa 


ataccctcaa 


ccaggtgttt 


gngataaaaa 


atgatgctaa 


60 


aatgaaaagg 


acaatgcgtg 


ccccctgcca 


tctggtcgca 


ttagtcctgc 


acatgctgcg 


120 


atgtgggcta 


caagtgttgg 


agttgcagga 


acagctttgt 


tggcctggaa 


ggctaatggc 


180 


ttggcagctg 


ggcttgcagc 


ttctaatctt 


gttctgtatg 


catttgtgta 


tacgccgttg 


240 


aagcaaatac 


accctgttaa 


tacatgggtt 


ggggcagtcg 


ttggtgccat 


cccaccact 


299 



<210> 27 

<211> 255 

<212> DNA 

40 <213> Zea sp 

<220> 

<221> misc_feature 

<222> (1) . . . (255) 

45 <223> n = A,T,C or G 
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<400> 27 

anacttgcat atctccatgc ntctcaggac aaagangatg acctgaaagt aggtgtcaag 60 

tccacagcat taagatttgg agatttgacc nnatactgna tcagtggctt tggcgcggca 12 0 
tgcttcggca gcttagcact cagtggttac aatgctgacc ttggttggtg tttagtgtga 
tgcttgagcg aagaatggta tngtttttac ttgatattga ctccagacct gaaatcatgt 

tggacagggt ggccc 255 



180 
240 



<210> 28 

<211> 257 

10 <212> DNA 

<213> Zea sp 



<400> 28 

attgaagggg ataggactct ggggcttcag tcacttcctg ttgcttttgg gatggaaact 60 

15 gcaaaatgga tttgtgttgg agcaattgat atcactcaat tatctgttgc aggttaccta 120 

ttgagcaccg gtaagctgta ttatgccctg gtgttgcttg ggctaacaat tcctcaggtg 180 

ttctttcagt tccagtactt cctgaaggac cctgtgaagt atgatgtcaa atatcaggca 24 0 

agcgcacaac cattctt 257 



20 ,<210> 29 
<211> 368 
<212> DNA 
<213> Zea sp 



25 <400> 29 

atccagttgc aaataataat ggcgttcttc 

cctgacatcg aaggggaccg catattcggg 

aagaaggtct tttggatctg cgttggcttg 

atgggagcta cctcttcctg tttgtggagc 

3 0 cttgccgcga tcctatggag ctgcgcgcga 

acgtccttct acatgttcat ctggaagctg 
gtgcggtg 



tctgttgtaa 


tagcactatt 


caaggatata 


60 


atccgatcct 


tcagcgtccg 


gttagggcaa 


120 


cttgagatgg 


cctacagcgt 


tgcgatactg 


180 


aaaacagcaa 


ccatcgctgg 


ccattccata 


240 


tcggtggact 


tgacgagcaa 


agccgcaata 


300 


ttctacgcgg 


agtacctgct 


catccctctg 


360 








368 



<210> 30 
35 <211> 122 
<212> PRT 
<213> Zea sp 



<400> 30 

40 He Gin Leu Gin He He Met Ala 
1 5 
Phe Lys Asp He Pro Asp He Glu 

20 

Ser Phe Ser Val Arg Leu Gly Gin 
45 35 40 

Gly Leu Leu Glu Met Ala Tyr Ser 



Phe Phe Ser Val Val He Ala Leu 

10 15 
Gly Asp Arg He Phe Gly He Arg 
25 30 
Lys Lys Val Phe Trp He Cys Val 

45 

Val Ala He Leu Met Gly Ala Thr 
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10 



15 



20 



50 55 60 

Ser Ser Cys Leu Trp Ser Lys Thr Ala Thr lie Ala Gly His Ser lie 
65 70 75 80 

Leu Ala Ala He Leu Trp Ser Cys Ala Arg Ser Val Asp Leu Thr Ser 

85 90 95 

Lys Ala Ala He Thr Ser Phe Tyr Met Phe He Trp Lys Leu Phe Tyr 

100 105 HO 

Ala Glu Tyr Leu Leu He Pro Leu Val Arg 
115 120 

<210> 31 
<211> 278 
<212> DNA 
<213> Zea sp 



<400> 31 














tattcagcac 


cacctctcaa 


gctcaagcag 


aatggatgga 


ttgggaactt 


cgctctgggt 


60 


gcgagttaca 


tcagcttgcc 


ctggtgggct 


ggccaggcgt 


tatttggaac 


tcttacacca 


120 


gatatcattg 


tcttgactac 


tttgtacagc 


atagctgggc 


tagggattgc 


tattgtaaat 


180 


gatttcaaga 


gtattgaagg 


ggataggact 


ctggggcttc 


agtcacttcc 


tgttgctttt 


240 


gggatggaaa 


ctgcaaaatg 


gatttgtgtt 


ggagcaat 






278 



<210> 32 
<211> 292 
25 <212> PRT 

<213> Synechocystis sp 



<400> 32 

Met Val Ala Gin Thr Pro Ser Ser Pro Pro Leu Trp Leu Thr He He 
30 1 5 10 15 

Tyr Leu Leu Arg Trp His Lys Pro Ala Gly Arg Leu He Leu Met He 

20 25 30 

Pro Ala Leu Trp Ala Val Cys Leu Ala Ala Gin Gly Leu Pro Pro Leu 
35 40 45 

35 Pro Leu Leu Gly Thr He Ala Leu Gly Thr Leu Ala Thr Ser Gly Leu 
50 55 60 

Gly Cys Val Val Asn Asp Leu Trp Asp Arg Asp He Asp Pro Gin Val 
65 70 75 80 

Glu Arg Thr Lys Gin Arg Pro Leu Ala Ala Arg Ala Leu Ser Val Gin 
40 85 90 95 

Val Gly He Gly Val Ala Leu Val Ala Leu Leu Cys Ala Ala Gly Leu 

100 105 HO 

Ala Phe Tyr Leu Thr Pro Leu Ser Phe Trp Leu Cys Val Ala Ala Val 
115 120 125 

45 Pro Val He Val Ala Tyr Pro Gly Ala Lys Arg Val Phe Pro Val Pro 
130 135 140 
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Gin Leu Val 
145 

Ser Ala Val 

5 Ala Thr Val 

Asp Arg Glu 
195 

Phe Gly Gin 
10 210 

lie Gly Cys 
225 

Tyr Trp Leu 

15 lie Gin Leu 

Phe Gly Gin 
275 

Leu Gly Trp 
20 290 



Leu Ser lie Ala Trp 
150 

Thr Gly Asp Leu Thr 
165 

Phe Trp Thr Leu Gly 
180 

Asp Asp Arg Arg lie 

200 

Tyr Val Gly Glu Ala 

215 

Leu Phe Tyr Leu Gly 
230 

Ser Leu Ala lie Ala 
245 

Ser Ala Pro Thr Pro 
260 

Asn Val lie lie Gly 

280 

Leu 



Gly Phe Ala Val 
155 

Asp Ala Thr Trp 
170 

Phe Asp Thr Val 
185 

Gly Val Asn Ser 

Val Gly lie Phe 

220 

Met lie Leu Met 
235 

lie Val Gly Trp 
250 

Glu Pro Lys Leu 
265 

Phe Val Leu Leu 



Leu lie Ser Trp 

160 

Val Leu Trp Gly 
175 

Tyr Ala Met Ala 
190 

Ser Ala Leu Phe 

205 

Phe Ala Leu Thr 

Leu Asn Pro Leu 

240 

Val lie Gin Tyr 
255 

Tyr Gly Gin lie 
270 

Ala Gly Met Leu 
285 



<210> 33 

<211> 316 

<212> PRT 

25 <213> Synechocystis sp 

<400> 33 

Met Val Thr Ser Thr Lys lie His Arg Gin His Asp Ser Met Gly Ala 
15 10 15 

30 Val Cys Lys Ser Tyr Tyr Gin Leu Thr Lys Pro Arg lie lie Pro Leu 

20 25 30 

Leu Leu lie Thr Thr Ala Ala Ser Met Trp lie Ala Ser Glu Gly Arg 

35 40 45 

Val Asp Leu Pro Lys Leu Leu lie Thr Leu Leu Gly Gly Thr Leu Ala 
35 50 55 60 

Ala Ala Ser Ala Gin Thr Leu Asn Cys lie Tyr Asp Gin Asp lie Asp 
65 70 75 80 

Tyr Glu Met Leu Arg Thr Arg Ala Arg Pro lie Pro Ala Gly Lys Val 

85 90 95 

40 Gin Pro Arg His Ala Leu lie Phe Ala Leu Ala Leu Gly Val Leu Ser 

100 105 110 

Phe Ala Leu Leu Ala Thr Phe Val Asn Val Leu Ser Gly Cys Leu Ala 
* . 115 120 125 

Leu Ser Gly lie Val Phe Tyr Met Leu Val Tyr Thr His Trp Leu Lys 
45 130 135 140 

Arg His Thr Ala Gin Asn lie Val lie Gly Gly Ala Ala Gly Ser lie 
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10 



15 



145 150 
Pro Pro Leu Val Gly Trp 

165 

Pro Trp Val Leu Phe Ala 

180 

Trp Ala Leu Ala Leu Met 
195 

Leu Pro Val lie 



Pro Met 
210 
Trp Tyr 
225 

Pro Leu 



Tyr Ser Leu Leu 



230 



His Gin Leu Gly 
245 

Gly Gly Gin Phe Leu Val 

260 

Asp Arg Asp Leu Ala Arg 
275 

Leu Cys Leu Ala 



20 



Met Leu 
290 
Gin Leu 
305 



Val Ala Gin Met 

310 



Ala Ala Val 

Leu lie Phe 
185 

lie Lys Asp 

200 
Ala Gly Glu 
215 

Val Val Pro 

lie Leu Tyr 

Lys Ala Trp 
265 

Gly Leu Phe 

280 
Met Val lie 
295 

Gly Thr Leu 



155 

Thr Gly Asp Leu 
170 

Leu Trp Thr Pro 

Asp Tyr Ala Gin 

205 

Glu Lys Thr Val 
220 

Phe Ser Leu Leu 
235 

Leu Ala lie Ala 
250 

Gin Leu Lys Gin 

Lys Phe Ser lie 

285 

Asp Ser Leu Pro 
300 

Leu Leu Gly 
315 



160 

Ser Trp Thr 
175 

Pro His Phe 
190 

Val Asn Val 

Ser Gin lie 

Leu Val Tyr 
240 

lie He Leu 
255 

Ala Pro Gly 
270 

Phe Tyr Leu 
Val Thr His 



<210> 34 

<211> 324 

25 <212> PRT 

<213> Synechocystis sp 



<400> 34 

Met Ser Asp Thr Gin Asn Thr Gly Gin Asn Gin Ala Lys Ala Arg Gin 
30 1 5 10 15 

Leu Leu Gly Met Lys Gly Ala Ala Pro Gly Glu Ser Ser He Trp Lys 

20 25 30 

He Arg Leu Gin Leu Met Lys Pro He Thr Trp He Pro Leu He Trp 
35 40 45 

35 Gly Val Val Cys Gly Ala Ala Ser Ser Gly Gly Tyr He Trp Ser Val 
50 55 60 

Glu Asp Phe Leu Lys Ala Leu Thr Cys Met Leu Leu Ser Gly Pro Leu 
65 70 75 80 

Met Thr Gly Tyr Thr Gin Thr Leu Asn Asp Phe Tyr Asp Arg Asp He 
40 85 90 95 

Asp Ala He Asn Glu Pro Tyr Arg Pro He Pro Ser Gly Ala He Ser 

100 105 HO 

Val Pro Gin Val Val Thr Gin He Leu He Leu Leu Val Ala Gly He 
115 120 125 

45 Gly Val Ala Tyr Gly Leu Asp Val Trp Ala Gin His Asp Phe Pro He 
130 135 140 
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Met Met Val 
145 

Ala Pro Pro 
5 Leu Gly Ala 

Phe Gly Thr 

195 

Leu Ala Gly 
10 210 

Gly Asp Arg 
225 

Gly Thr Ala 

15 Gly lie Ala 

lie Val Leu 
275 

Phe Leu Arg 
20 290 

Gin Pro Phe 
305 

His Ala Gly 



Leu Thr Leu Gly 
150 

Leu Lys Leu Lys 
165 

Ser Tyr lie Ala 
180 

Leu Asn Pro Thr 

Leu Gly lie Ala 

215 

Gin Leu Gly Leu 
230 

Ala Trp lie Cys 
245 

Gly Tyr Leu lie 
260 

Leu Leu Leu lie 

Asn Pro Leu Glu 

295 

Leu Val Phe Gly 
310 

lie 



Gly Ala Phe Val Ala 

155 

Gin Asn Gly Trp Leu 
170 

Leu Pro Trp Trp Ala 
185 

lie Met Val Leu Thr 
200 

Val Val Asn Asp Phe 

220 

Lys Ser Leu Pro Val 

235 

Val lie Met lie Asp 
250 

Tyr Val His Gin Gin 
265 

Pro Gin lie Thr Phe 
280 

Asn Asp Val Lys Tyr 

300 

Met Leu Ala Thr Gly 

315 



Tyr lie Tyr Ser 

160 

Gly Asn Tyr Ala 
175 

Gly His Ala Leu 
190 

Leu lie Tyr Ser 
205 

Lys Ser Val Glu 

Met Phe Gly lie 

240 

Val Phe Gin Ala 
255 

Leu Tyr Ala Thr 
270 

Gin Asp Met Tyr 
285 

Gin Ala Ser Ala 

Leu Ala Leu Gly 

320 



30 



<210> 35 
<211> 307 
<212> PRT 

<213> Synechocystis sp 



<400> 35 

Met Thr Glu Ser Ser Pro Leu Ala Pro Ser Thr Ala Pro Ala Thr Arg 

15 10 15 

Lys Leu Trp Leu Ala Ala lie Lys Pro Pro Met Tyr Thr Val Ala Val 
35 20 25 30 

Val Pro He Thr Val Gly Ser Ala Val Ala Tyr Gly Leu Thr Gly Gin 

35 40 45 

Trp His Gly Asp Val Phe Thr He Phe Leu Leu Ser Ala He Ala He 
50 55 60 

40 He Ala Trp He Asn Leu Ser Asn Asp Val Phe Asp Ser Asp Thr Gly 
65 70 75 80 

He Asp Val Arg Lys Ala His Ser Val Val Asn Leu Thr Gly Asn Arg 

85 90 95 

Asn Leu Val Phe Leu lie Ser Asn Phe Phe Leu Leu Ala Gly Val Leu 
45 100 105 110 

Gly Leu Met Ser Met Ser Trp Arg Ala Gin Asp Trp Thr Val Leu Glu 
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115 120 125 

Leu lie Gly Val Ala lie Phe Leu Gly Tyr Thr Tyr Gin Gly Pro Pro 

130 135 140 

Phe Arg Leu Gly Tyr Leu Gly Leu Gly Glu Leu lie Cys Leu lie Thr 
5 145 150 155 160 

Phe Gly Pro Leu Ala lie Ala Ala Ala Tyr Tyr Ser Gin Ser Gin Ser 

165 170 175 

Phe Ser Trp Asn Leu Leu Thr Pro Ser Val Phe Val Gly lie Ser Thr 

180 185 190 

10 Ala lie lie Leu Phe Cys Ser His Phe His Gin Val Glu Asp Asp Leu 

195 200 205 

Ala Ala Gly Lys Lys Ser Pro lie Val Arg Leu Gly Thr Lys Leu Gly 

210 215 220 

Ser Gin Val Leu Thr Leu Ser Val Val Ser Leu Tyr Leu lie Thr Ala 
15 225 230 235 240 

lie Gly Val Leu Cys His Gin Ala Pro Trp Gin Thr Leu Leu lie lie 

245 250 255 

Ala Ser Leu Pro Trp Ala Val Gin Leu lie Arg His Val Gly Gin Tyr 

260 265 270 

2 0 His Asp Gin Pro Glu Gin Val Ser Asn Cys Lys Phe lie Ala Val Asn 

275 280 285 

Leu His Phe Phe Ser Gly Met Leu Met Ala Ala Gly Tyr Gly Trp Ala 

290 295 300 

Gly Leu Gly 
25 305 



<210> 36 

<211> 927 

<212> DNA 

3 0 <213> Synechocystis sp 



<400> 36 

atggcaacta tccaagcttt ttggcgcttc 

ctgagcgtct gggctgtgta tctgttaact 

35 cctgcttccc tggatttagt gttcggcgct 

attgtcggcc tcaaccaatt gtgggatgtg 

cccctagcta acggagattt ttctatcgcc 

gttgcttcct tggcgatcgc ctggggatta 

agtttgatta ttggcacggc ctattcggtg 

40 ctggcggccc tgtgtattct gacggtgcgg 

ttttttagaa ttggtttagg ttatcccccc 

ttatttatct tagttttcac cgtggcgatc 

ggcgatcggc aatttaagat tcaaacttta 

cggggaacct taattttact cactggttgt 

45 gcggctatgc ctttaaatac tgctttcttg 

ctctggtggc ggagtcgaga tgtacactta 



tcccgccccc 


ataccatcat 


tggtacaact 


60 


attctcgggg 


atggaaactc 


agttaactcc 


120 


tggctggcct 


gcctgttggg 


taatgtgtac 


180 


gacattgacc 


gcatcaataa 


gccgaatttg 


240 


cagggccgtt 


ggattgtggg 


actttgtggc 


300 


gggctatggc 


tggggctaac 


ggtgggcatt 


360 


ccgccagtga 


ggttaaagcg 


cttttccctg 


420 


ggaattgtgg 


ttaacttggg 


cttattttta 


480 


actttaataa 


cccccatctg 


ggttttgact 


540 


gccattttta 


aagatgtgcc 


agatatggaa 


600 


actttgcaaa 


tcggcaaaca 


aaacgttttt 


660 


tatttagcca 


tggcaatctg 


gggcttatgg 


720 


attgtttccc 


atttgtgctt 


attagcctta 


780 


gaaagcaaaa 


ccgaaattgc 


tagtttttat 


840 
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cagtttattt ggaagctatt tttcttagag tacttgctgt atcccttggc tctgtggtta 900 
cctaattttt ctaatactat tttttag 927 



<210> 37 

5 <211> 308 

<212> PRT 

<213> Synechocystis sp 



<400> 37 

10 Met Ala Thr lie Gin Ala Phe Trp Arg Phe Ser Arg Pro His Thr lie 
15 10 15 

lie Gly Thr Thr Leu Ser Val Trp Ala Val Tyr Leu Leu Thr lie Leu 

20 25 30 

Gly Asp Gly Asn Ser Val Asn Ser Pro Ala Ser Leu Asp Leu Val Phe 
15 35 40 45 

Gly Ala Trp Leu Ala Cys Leu Leu Gly Asn Val Tyr lie Val Gly Leu 

50 55 60 

Asn Gin Leu Trp Asp Val Asp lie Asp Arg lie Asn Lys Pro Asn Leu 
65 70 75 80 

20 Pro Leu Ala Asn Gly Asp Phe Ser lie Ala Gin Gly Arg Trp lie Val 

85 90 95 

Gly Leu Cys Gly Val Ala Ser Leu Ala lie Ala Trp Gly Leu Gly Leu 

100 105 110 

Trp Leu Gly Leu Thr Val Gly lie Ser Leu He He Gly Thr Ala Tyr 
25 115 120 125 

Ser Val Pro Pro Val Arg Leu Lys Arg Phe Ser Leu Leu Ala Ala Leu 

130 135 140 

Cys He Leu Thr Val Arg Gly He Val Val Asn Leu Gly Leu Phe Leu 
145 150 155 160 

30 Phe Phe Arg He Gly Leu Gly Tyr Pro Pro Thr Leu He Thr Pro He 

165 170 175 

Trp Val Leu Thr Leu Phe He Leu Val Phe Thr Val Ala He Ala He 

180 185 190 

Phe Lys Asp Val Pro Asp Met Glu Gly Asp Arg Gin Phe Lys He Gin 
35 195 200 205 

Thr Leu Thr Leu Gin He Gly Lys Gin Asn Val Phe Arg Gly Thr Leu 

210 215 220 

He Leu Leu Thr Gly Cys Tyr Leu Ala Met Ala He Trp Gly Leu Trp 
225 230 235 240 

40 Ala Ala Met Pro Leu Asn Thr Ala Phe Leu He Val Ser His Leu Cys 

245 250 255 

Leu Leu Ala Leu Leu Trp Trp Arg Ser Arg Asp Val His Leu Glu Ser 

260 265 270 

Lys Thr Glu He Ala Ser Phe Tyr Gin Phe He Trp Lys Leu Phe Phe 
45 275 280 285 

Leu Glu Tyr Leu Leu Tyr Pro Leu Ala Leu Trp Leu Pro Asn Phe Ser 
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290 

Asn Thr lie Phe 
305 



295 



300 



5 <210> 38 

<211> 1092 
<212> DNA 

<213> Synechocystis sp 



10 <400> 38 

atgaaatttc cgccccacag tggttaccat 
tggtacgtgc gcctgctttt gccccaatcc 
gaaaatcctg ctagcgatca tcattacggc 
acgaaaaaac aagaaaatca ggaagaccaa 

15 aaattttggg ccagtcctcg ccagtttgcc 
aggcaggcga aacccctact ctccgaagaa 
atccatcaaa atcagcacca aggacaaatc 
ttcaccgtag aaccggaagt aacttggggg 
ggttggcttt cctttttacc cttgtttgat 

20 agagcgcacg gctggctgaa atggcagagg 
tatgccgaaa aaaattgggg tcactccttt 
tattttcctg accatccagg actgagcgtc 
ggtcgccccg aagaggtagc tttaattggc 
ggcccgggcc atggcacagt cacttggcaa 

25 gccagcaatg ataggtattg ggtcaagttg 
gtccacactc ccaccgccca gggcttacaa 
ttgtatttgc aattgggatc tgtgggtcac 
gcggggctag aagttggagg tgattggggt 
gtgccattct ga 

30 

<210> 39 
<211> 363 
<212> PRT 

<213> Synechocystis sp 

35 

<400> 39 

Met Lys Phe Pro Pro His Ser Gly 

1 5 
Phe Phe Glu Gly Trp Tyr Val Arg 
40 20 

Ser Phe Ala Phe Met Tyr Ser lie 

35 40 
Tyr Gly Gly Gly Ala Val Gin He 
50 55 
45 Glu Asn Gin Glu Asp Gin Leu Val 
65 70 



tggcaaggtc 


aatcaccttt 


ctttgaaggt 


60 


ggggaaagtt 


ttgcttttat 


gtactccatc 


120 


ggcggtgctg 


tgcaaatttt 


agggccggct 


180 


cttgtttggc 


ggacatttcc 


ctcggtaaaa 


240 


I— ciy y y a. i— i— 


acr a cr a a a a t cr 


tacrcraataac 


300 


ttttttgcca 


cggtcaagga 


aggttatcaa 


360 - 


attcatggcg 


atcgccattg 


tcgttggcag 


420 


agtcctaacc 


gatttcctcg 


ggctacagcg 


480 


cccggttggc 


aaattctttt 


agcccaaggt 


540 


gaacagtatg 


aatttgacca 


cgccctagtt 


600 


ccctcccgct 


ggttttggct 


ccaagcaaat 


660 


actgccgctg 


gcggggaacg 


gattgttctt 


720 


ttacatcacc 


aaggtaattt 


ttacgaattt 


780 


gtagctccct 


ggggccgttg 


gcaattaaaa 


840 


tccggaaaaa 


cagataaaaa 


aggcagttta 


900 


ctcaactgcc 


gagataccac 


taggggctat 


960 


ggcctgatag 


tgcaagggga 


aacggacacc 


1020 


ttaacagagg 


aaaatttgag 


caaaaaaaca 


1080 
1092 



Tyr His Trp Gin Gly Gin Ser Pro 

10 15 
Leu Leu Leu Pro Gin Ser Gly Glu 
25 30 
Glu Asn Pro Ala Ser Asp His His 

45 

Leu Gly Pro Ala Thr Lys Lys Gin 

60 

Trp Arg Thr Phe Pro Ser Val Lys 
75 80 
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Lys Phe Trp Ala Ser 

85 

Cys Arg Asp Asn Arg 

100 

5 Ala Thr Val Lys Glu 

115 

Gin He He His Gly 
130 

Pro Glu Val Thr Trp 
10 145 

Gly Trp Leu Ser Phe 

165 

Leu Ala Gin Gly Arg 

180 

15 Tyr Glu Phe Asp His 

195 

Ser Phe Pro Ser Arg 
210 

His Pro Gly Leu Ser 
20 225 

Gly Arg Pro Glu Glu 

245 

Phe Tyr Glu Phe Gly 

260 

25 Pro Trp Gly Arg Trp 

275 

Lys Leu Ser Gly Lys 
290 

Thr Ala Gin Gly Leu 
30 305 

Leu Tyr Leu Gin Leu 

325 

Glu Thr Asp Thr Ala 

340 

35 Glu Glu Asn Leu Ser 

355 



Pro Arg Gin Phe Ala 

90 

Gin Ala Lys Pro Leu 

105 

Gly Tyr Gin He His 
120 

Asp Arg His Cys Arg 
135 

Gly Ser Pro Asn Arg 
150 

Leu Pro Leu Phe Asp 

170 

Ala His Gly Trp Leu 

185 

Ala Leu Val Tyr Ala 
200 

Trp Phe Trp Leu Gin 
215 

Val Thr Ala Ala Gly 
230 

Val Ala Leu He Gly 

250 

Pro Gly His Gly Thr 

265 

Gin Leu Lys Ala Ser 
280 

Thr Asp Lys Lys Gly 
295 

Gin Leu Asn Cys Arg 
310 

Gly Ser Val Gly His 

330 

Gly Leu Glu Val Gly 

345 

Lys Lys Thr Val Pro 
360 



Leu Gly His Trp Gly Lys 

95 

Leu Ser Glu Glu Phe Phe 

110 

Gin Asn Gin His Gin Gly 
125 

Trp Gin Phe Thr Val Glu 
140 

Phe Pro Arg Ala Thr Ala 
155 160 
Pro Gly Trp Gin He Leu 

175 

Lys Trp Gin Arg Glu Gin 

190 

Glu Lys Asn Trp Gly His 
205 

Ala Asn Tyr Phe Pro Asp 
220 

Gly Glu Arg He Val Leu 
235 240 
Leu His His Gin Gly Asn 

255 

Val Thr Trp Gin Val Ala 

270 

Asn Asp Arg Tyr Trp Val 
285 

Ser Leu Val His Thr Pro 
300 

Asp Thr Thr Arg Gly Tyr 
315 320 
Gly Leu He Val Gin Gly 

335 

Gly Asp Trp Gly Leu Thr 

350 

Phe 



<210> 40 
<211> 56 
40 <212> DNA 

<213> Artifical Sequence 



<400> 40 

cgcgatttaa atggcgcgcc ctgcaggcgg ccgcctgcag ggcgcgccat ttaaat 

45 

<210> 41 
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<211> 32 
<212> DNA 

<213> Artifical Sequence 
5 <400> 41 

tcgaggatcc gcggccgcaa gcttcctgca gg 32 

<210> 42 
<211> 32 
10 <212> DNA 

<213> Artifical Sequence 

<400> 42 

tcgacctgca ggaagcttgc ggccgcggat cc 32 

15 

<210> 43 
<211> 32 
<212> DNA 

<213> Artifical Sequence 

20 

<400> 43 

tcgacctgca ggaagcttgc ggccgcggat cc 3 2 

<210> 44 
25 <211> 32 
<212> DNA 

<213> Artifical Sequence 
<400> 44 

30 tcgaggatcc gcggccgcaa gcttcctgca gg 3 2 

<210> 45 
<211> 36 
<212> DNA 
35 <213> Artifical Sequence 

<400> 45 

tcgaggatcc gcggccgcaa gcttcctgca ggagct 3 6 

40 <210> 46 
<211> 28 
<212> DNA 

<213> Artifical Sequence 
45 <400> 46 

cctgcaggaa gcttgcggcc gcggatcc 2 8 
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<210> 47 
<211> 36 
<212> DNA 
5 <213> Artifical Sequence 

<400> 47 

tcgacctgca ggaagcttgc ggccgcggat ccagct 

10 <210> 48 
<211> 28 
<212> DNA 

<213> Artifical Sequence 

15 <400> 48 

ggatccgcgg ccgcaagctt cctgcagg 

<210> 49 
<211> 39 
20 <212> DNA 

<213> Artifical Sequence 

<400> 49 

gatcacctgc aggaagcttg cggccgcgga tccaatgca 

25 

<210> 50 
<211> 31 
<212> DNA 

<213> Artifical Sequence 

30 

<400> 50 

ttggatccgc ggccgcaagc ttcctgcagg t 

<210> 51 
35 <211> 41 
<212> DNA 

<213> Artifical Sequence 
<400> 51 

40 ggatccgcgg ccgcacaatg gagtctctgc tctctagttc 

<210> 52 
<211> 38 
<212> DNA 
45 <213> Artifical Sequence 
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<400> 52 

ggatcctgca ggtcacttca aaaaaggtaa cagcaagt 

<210> 53 
5 <211> 45 
<212> DNA 

<213> Artifical Sequence 
<400> 53 

10 ggatccgcgg ccgcacaatg gcgttttttg ggctctcccg tgttt 

<210> 54 
<211> 40 
<212> DNA 
15 <213> Artifical Sequence 

<400> 54 

ggatcctgca ggttattgaa aacttcttcc aagtacaact 

20 <210> 55 
<211> 38 
<212> DNA 

<213> Artifical Sequence 

25 <400> 55 

ggatccgcgg ccgcacaatg tggcgaagat ctgttgtt 

<210> 56 
<211> 37 
30 <212> DNA 

<213> Artifical Sequence 

<400> 56 

ggatcctgca ggtcatggag agtagaagga aggagct 

35 

<210> 57 
<211> 50 
<212> DNA 

<213> Artifical Sequence 

40 

<400> 57 

ggatccgcgg ccgcacaatg gtacttgccg aggttccaaa gcttgcctct 



45 



<210> 58 
<211> 38 
<212> DNA 
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<213> Artifical Sequence 
<400> 58 

ggatcctgca ggtcacttgt ttctggtgat gactctat 

5 

<210> 59 
<211> 38 
<212> DNA 

<213> Artifical Sequence 

10 

<400> 59 

ggatccgcgg ccgcacaatg acttcgattc tcaacact 

<210> 60 
15 <211> 36 
<212> DNA 

<213> Artifical Sequence 
<400> 60 

20 ggatcctgca ggtcagtgtt gcgatgctaa tgccgt 

<210> 61 
<211> 22 
<212> DNA 
25 <213> Artifical Sequence 

<400> 61 

taatgtgtac attgtcggcc tc 

30 <210> 62 
<211> 60 
<212> DNA 

<213> Artifical Sequence 
35 <400> 62 

gcaatgtaac atcagagatt ttgagacaca acgtggcttt ccacaattcc ccgcaccgtc 

<210> 63 
<211> 22 
4 0 <212> DNA 

<213> Artifical Sequence 

<400> 63 

aggctaataa gcacaaatgg ga 

45 

<210> 64 
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<211> 63 
<212> DNA 

<213> Artifical Sequence 

5 <400> 64 

ggtatgagtc agcaacacct tcttcacgag 
ccc 

<210> 65 
10 <211> 26 
<212> DNA 

<213> Artifical Sequence 
<400> 65 

15 ggatccatgg ttgcccaaac cccatc 

<210> 66 

<211> 61 

<212> DNA 

20 <213> Artifical Sequence 

<400> 66 

gcaatgtaac atcagagatt ttgagacaca 
c 

25 

<210> 67 
<211> 25 
<212> DNA 

<213> Artifical Sequence 

30 

<400> 67 

gaattctcaa agccagccca gtaac 

<210> 68 
35 . <211> 63 
<212> DNA 

<213> Artifical Sequence 
<400> 68 

40 ggtatgagtc agcaacacct tcttcacgag 
ccc 

<210> 69 
<211> 23 
45 <212> DNA 

<213> Artifical Sequence 



PCT/USOO/10368 



gcagacctca gcggaattgg tttaggttat 



acgtggcttt gggtaagcaa caatgaccgg 



gcagacctca gcgggtgcga aaagggtttt 
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<400> 69 

ccagtggttt aggctgtgtg gtc 

5 <210> 70 

<211> 21 

<212> DNA 

<213> Artifical Sequence 

10 <400> 70 

ctgagttgga tgtattggat c 

<210> 71 
<211> 28 
15 <212> DNA 

<213> Artifical Sequence 

<400> 71 

ggatccatgg ttacttcgac aaaaatcc 

20 

<210> 72 
<211> 60 
<212> DNA 

<213> Artifical Sequence 

25 

<400> 72 

gcaatgtaac atcagagatt ttgagacaca 

<210> 73 
30 <211> 28 
<212> DNA 

<213> Artifical Sequence 
<400> 73 

35 gaattcttaa cccaacagta aagttccc 

<210> 74 

<211> 63 

<212> DNA 

40 <213> Artifical Sequence 

<400> 74 

ggtatgagtc agcaacacct tcttcacgag 
■ atg 

45 

<210> 75 



acgtggcttt gctaggcaac cgcttagtac 



gcagacctca gcgccggcat tgtcttttac 
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<211> 20 
<212> DNA 

<213> Artifical Sequence 

5 <400> 75 

ggaacccttg cagccgcttc 

<210> 76 
<211> 22 
10 <212> DNA 

<213> Artifical Sequence 

<400> 76 

gtatgcccaa ctggtgcaga gg 

15 

<210> 77 
<211> 28 
<212> DNA 

<213> Artifical Sequence 

20 

<400> 77 

ggatccatgt ctgacacaca aaataccg 

<210> 78 
25 <211> 62 
<212> DNA 

<213> Artifical Sequence 
<400> 78 

3 0 gcaatgtaac atcagagatt ttgagacaca 
ag 

<210> 79 
<211> 27 
35 <212> DNA 

<213> Artifical Sequence 

<400> 79 

gaattctcaa atccccgcat ggcctag 

40 

<210> 80 
<211> 65 
<212> DNA 

<213> Artifical Sequence 

45 

<400> 80 



PCT7US00/10368 



20 



22 



28 



acgtggcttt cgccaatacc agccaccaac 60 

62 
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ggtatgagtc agcaacacct tcttcacgag gcagacctca gcggcctacg gcttggacgt 60 
gtggg 65 

<210> 81 
5 <211> 21 
<212> DNA 

<213> Artifical Sequence 
<400> 81 

10 cacttggatt cccctgatct g 21 

<210> 82 
<211> 21 
<212> DNA 
15 <213> Artifical Sequence 

<400> 82 

gcaatacccg cttggaaaac g 21 

20 <210> 83 
<211> 29 
<212> DNA 

<213> Artifical Sequence 
25 <400> 83 

ggatccatga ccgaatcttc gcccctagc 29 

<210> 84 
<211> 61 
3 0 <212> DNA 

<213> Artifical Sequence 

<400> 84 

gcaatgtaac atcagagatt ttgagacaca acgtggcttt caatcctagg tagccgaggc 60 
9 61 



35 



<210> 85 
<211> 27 
<212> DNA 
40 <213> Artifical Sequence 

<400> 85 

gaattcttag cccaggccag cccagcc 27 

45 <210> 86 
<211> 66 
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<212> DNA 

<213> Artifical Sequence 
<400> 86 

5 ggtatgagtc agcaacacct tcttcacgag gcagacctca gcggggaatt gatttgttta 
attacc 

<210> 87 

<211> 21 

10 <212> DNA 

<213> Artifical Sequence 

<400> 87 

gcgatcgcca ttatcgcttg g 

15 

<210> 88 
<211> 24 
<212> DNA 

<213> Artifical Sequence 

20 

<400> 88 

gcagactggc aattatcagt aacg 

<210> 89 
25 <211> 25 
<212> DNA 

<213> Artifical Sequence 
<400> 89 

30 ccatggattc gagtaaagtt gtcgc 

<210> 90 
<211> 0 

<213> Artifical Sequence 

35 

<400> 90 

gaattcactt caaaaaaggt aacag 

<210> 91 
40 <211> 4550 
<212> DNA 

<213> Arabidopsis sp 



45 



<400> 91 

attttacacc aatttgatca cttaactaaa ttaattaaat tagatgatta tcccaccata 
tttttgagca ttaaaccata aaaccatagt tataagtaac tgttttaatc gaatatgact 
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cgattaagat taggaaaaat ttataaccgg 
taaatgccga ttcctccctt gtctaaaaga 
tgtttcactc tatttaattt caggcacaat 
caacacgtga tacttttcct cgtccgtcag 
5 caaatctaca ccacattttt tgcttaatct 
agtctaacta attcttctaa tataagtaca 
taattttcaa aatctaatct aaatatctaa 
aatgacacca attaatcatc ctcgacccac 
ttttttgctc tctgttcctt caaaatcatt 

10 ttctttgtct ttgatttttg attttttttc 
atggagtctc tgctctctag ttcttctctt 
gtttcaggtt ttatttgttg tttaggtttc 
ttgaactttt ctgaatataa aataaggaaa 
tagatcgaag taggtgacaa aggttattgt 

15 tgaattttgt ttctcatgca tgcaacttat 
cagaatctaa agctccactc tttatcaggt 
aatactcaat catcttagtc tcattattct 
tttatgagac aatgtatgtt ggacttagtt 
gttactgatg ttgtttagct ctttacacca 

2 0 gttctgcgtt gtgattcgag taaagttgtc 

aggcctgatg gtcaaggatc ttcattgttg 
gttaatgcca ctgcgggtca gcctgaggct 
agagactcgt tagatgcgtt ttacaggttt 
aagtttctct ttaaaaatgt aactctttta 
25 taacattagc tctgtgattg gatttgcagg 
cagtagagaa ggtttctgat atatctcctt 
atatataaca cataatgacc gatgaagaag 
attgggtttt gttttcaggc tgttgttgca 
ctaaatcagt tgtctgatgt tgaaatagat 

3 0 ttcgagagac tgatgagatt aatagcagct 

gcaggttaac aagccctatc ttccattggc 
tgcaatagta gcttccttct ccatcatggt 
tagaattcta taagttactg aaatagtttg 
gtggattgtt ggttcatggc cattgttctg 

35 tgcatactct atcaatgtaa gtaagtttct 
tgcagtttct agttttaggt taatgaggtt 
ccacttttac ggtggaaaag atttgcattg 
gctattattg ttcaaatcgc cttttatcta 
gttttgtagt tgttttcatc aaaatcactt 

40 cagacacatg tgtttggaag accaatcttg 
tttatgagct ttttctctgt cgttattgca 
ttaaatctat gtatacttaa agtaaagcat 
ttggttggat gcaggatata cctgatatcg 
tctctgtaac tctgggtcag aaacgggtac 

45 caagtgttgg attaagatta cagaagaaag 
gtgttttgga catgtgttac actacttcaa 



taattaagaa 


aacattaacc 


gtagtaaccg 


180 


cagaaaacat 


atattttatt 


ttgccccata 


240 


acttttggtt 


ggtaacaaaa 


ctaaaaagga 


300 


tcagattttt 


tttaaactag 


aaacaagtgg 


360 


attaacttgt 


aagttttaaa 


ttcctaaaaa 


420 


ttccctaaat 


ttcccaaaaa 


gtcaaattaa 


480 


taattcaaaa 


tcattaaaaa 


gacacgcaac 


540 


acaattctac 


agttctcatg 


ctaaaccata 


600 


tctttctctt 


ctttgattcc 


caaagatcac 


660 


tctctggcgt 


gaaggaagaa 


gctttatttc 


720 


gtttccgctg 


gtaaatctcg 


tccttttctg 


780 


gtttttgtga 


ttcagaacca 


tacaaaaagt 


840 


aagtttcgat 


ttttataatg 


aattgtttac 


900 


gtggagaagc 


ataatttctg 


ggcttgactt 


960 


caatcagctg 


gtgggttttg 


ttggaagaag 


1020 


tcgttagggt 


tttatgggtt 


tttgaaatta 


1080 


attggttgaa 


tcacattttc 


taatttggaa 


1140 


gaagttcttc 


tctttggtta 


tagttgaagt 


1200 


atatatacac 


ccaattttgc 


agaaatccga 


1260 


gcaaaaccga 


agtttaggaa 


caatcttgtt 


1320 


ttgtatccaa 


aacataagtc 


gagatttcgg 


1380 


ttcgactcga 


atagcaaaca 


gaagtctttt 


1440 


tctaggcctc 


atacagttat 


tggcacagtt 


1500 


aaacgcaatc 


tttcagggtt 


ttcaaggaga 


1560 


tgcttagcat 


tttatctgta 


tctttcttag 


1620 


tacttttcac 


tggcatcttg 


gaggtaatga 


1680 


atacattttt 


ttcgtctctc 


tgtttaaaca 


1740 


gctctcatga 


tgaacattta 


catagttggg 


1800 


aaggtaacat 


gcaaattttc 


ttcatatgag 


1860 


agtgcctaga 


tcatctctat 


gtgggttttt 


1920 


atcaggagaa 


tattctgtta 


acaccggcat 


1980 


atggtgccat 


tttcacaaaa 


tttcaacttt 


2040 


ttataaatcg 


ttatagagtt 


tctggcttgg 


2100 


ggctcttttt 


gtgagtttca 


tgctcggtac 


2160 


caatactaga 


atttggctca 


aatcaaaatc 


2220 


ttaataactt 


acttctacta 


caaacagttg 


2280 


gttgcagcaa 


tgtgtatcct 


cgctgtccga 


2340 


catattcagg 


tactaaacca 


ttttccttat 


2400 


ttatattact 


aaagctgtga 


aactttgttg 


2460 


ttcactaggc 


ctcttatttt 


cgccactgcg 


2520 


ttgtttaagg 


taaacaaaga 


tggaaaaaga 


2580 


tctactgtta 


ttgatgagaa 


gttttctttt 


2640 


aaggggataa 


gatattcgga 


atccgatcat 


2700 


gatatctaaa 


ctaaagaaat 


tgttttgact 


2760 


aaaactgttt 


ttgtttcttg 


caaaattcag 


2820 


atggcttacg 


ctgttgcaat 


tctagttgga 


2880 
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gccacatctc cattcatatg gagcaaagtc 
aaaactcgct aattcatcgt ttgagtggta 
ttttttcagg ttgtgggtca tgttatactc 
gttgatctga gtagcaaaac cgaaataact 
5 ttcgtttata aatagagtct ttactgcctt 
cctttcagtt tcatcgaatc accattatac 
ttatgcagag tacttgctgt tacctttttt 
ggagataaaa gaataagtca tcactatgct 
gtagtgaact agtgaattag agttttattc 
10 aagatatgaa tttctgttgg gtaaagaagt 
gtgttgatat aatgctaagc gaagaaatcg 
taaacatgtc agaacatctc cattctatat 
acctaaactc tttatctctg tgtagttaag 
gttgatgtaa tttgcagaac gtatggattt 
15 gtttatatat atggataatt cagacctaac 
atagtttgct ctgtaataga tagttccatt 
gcgttttgtg gttgatactg actactgagt 
agaagaatat aggctcacgg gaacgactgt 
gcggctttgc caaagaccga gtcacgatcg 
20 tgattgacca ttgcttagag acgcattgga 
tcaagtacgt gtcagatcat acgatgtagg 
agtcacaatg cttaatgggc ttattggccc 
ttcgtttgtc ccctggtggt gagtattatt 
tagagtgaat ctagtagagt cctagaccat 
25 atgaacaatt ctttttgtaa ggaaaacttt 
agttgaacta acttcgtgca attgcataat 
ataaacattt cgacgtacca agagttcgaa 
gactaatttg tacaatgaat ggttaataaa 



atctcggtaa 


caatctttct 


ttacccatcg 


2940 


ctggtttcat 


tttgttccgt 


tctgttgatt 


3000 


gcaacaactt 


tgtgggctcg 


agctaagtcc 


3060 


tcatgttata 


tgttcatatg 


gaaggttaga 


3120 


tttatgcgct 


ccaatttgga 


attaaaatag 


3180 


tgataaattc 


tcatttctgc 


atcagctctt 


3240 


gaagtgactg 


acattagaag 


agaagaagat 


3300 


tctgttttta 


ttacaagttc 


atgaaattag 


3360 


tgaaacatgg 


cagactgcaa 


aaatatgtca 


3420 


ctctgcttgg 


gcaaaatctt 


aaggttcggt 


3480 


attctatgta 


gaaatttccg 


aaactatgtg 


3540 


cttcttctgc 


aagaaagctc 


tgtttttatc 


3600 


atatgtatat 


gtacgtgact 


acattttttt 


3660 


ttgttagaaa 


gcatgagttc 


gaaagtatat 


3720 


gtcgaagctc 


acaagcataa 


attcactact 


3780 


gatgtcttga 


aactgtacgt 


aactgcctgg 


3840 


gttctttgtg 


agtgttgtaa 


gtatacaaga 


3900 


ggtggaagat 


gaaatggaga 


tcatcacgta 


3960 


agtctatgaa 


gtctttacag 


ctgctgatta 


4020 


atcttactag 


ggacttgcct 


gggagtttct 


4080 


agatttcacg 


gctttgatgt 


gtttgtttgg 


4140 


aataatagct 


agctcttttg 


ctttagccgt 


4200 


agggtatggt 


gtgaccaaag 


tcaccagacc 


4260 


ggtccatggc 


ttttatttgt 


aatttgaaaa 


4320 


tatatagtag 


acgtttacta 


tatagaaact 


4380 


aatggtgtga 


aatagagggt 


gcaaaactca 


4440 


acaataagca 


aaatagattt 


ttttgcttca 


4500 


ccattgaagc 


ttttattaat 




4550 



<211> 4450 
<212> DNA 

<213> Arabidopsis sp 



35 <400> 92 

tttaggttac aaaatcaatg atattgcgta tgtcaactat aaaagccaaa agtaaagcct 
cttgtttgac cagaaggtca tgatcattgt atacatacag ccaaactacc tcctggaaga 120 
aaagacatgg atcccaaaca acaacaatag cttcttttac aagaaccagt agtaactagt 
cactaatcta aaagagttaa gtttcagctt ttctggcaat ggctccttga tcatttcaat 

40 cctgaaggag acccactttg tagcaagacc atgtcctctg tttcacttac agtgtgtctc 
aaaagtctac ttcaattctt catatatagg ttcctcacac tacagcttca tcctcattcg 
ttgacagaga gagagtcttt attgaaaact tcttccaagt acaactccac taaatataat 
agcaccaaac cacttgttcg acacaaatct gtacagatat aaaaacacta ttaggttttc 
caaggcaaat cacataattg gattgtgaaa gagtacaaaa gataaaccca aattttcata 

45 ctttctactg cagtcagcac cagatgataa gtcagctgtc cctatttgcc atcctaactg 
tcctgatgca gcggccagtg atgcgtaata ttgccaccct taatcattag agcgagaaac 



60 



180 
240 
300 
360 
420 
480 
540 
600 
660 
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aaaaagaatc aaaagacagt aaatggaatt 
attgagtacc gagatctgca ctgaatccag 
caaatccagt taaccaaagc tttgtattat 
caacttttac atcatcttct ttgtcctgga 
5 aaaaaaatga tttaacctag aatatctcaa 
gaaattttgg gttcgtagct tgtggcatat 
aactttcttt tctcacttct gttgcaaacg 
taaagtatag aaatcagatg gaaaaggtgg 
gacaaaagtc gaacatcgaa atggatgcat 
10 agaaatctgt ggtggtgaag ctagaaaaag 
gattaactac tttgctactg gtcataatca 
gaatatacct gatgtgcata aatagtatca 
tagagaggga gtacaataga tggtgctatg 
gctccccagt ttatggtcaa acctaaaaag 
15 atcataagaa aatcagaaaa tatataatgt 
ttacccaaaa tgtaaacctc ttcataagtg 
cccctaaaac acggctgcag aatatacata 
atcacaaaac taaagacaag acctgagaac 
gggtgaacca tatgtgtatg tgaattttta 
20 gcaagtaaaa aatccaaaca aacctgtaat 
aaagcaactg cagcccgaga aatccaatcc 
taggtcttag ttttgtacga tcaacctgga 
taaaacaaaa caaccataca aaatcttgag 
tggaagaatg aatccagtta catgaatgct 
25 ttcaatcgaa aaacatattc caccttcacc 
gaacgaagtc atcagaacat gcagataagc 
tgttgtcgta aattgatcca acatagaaaa 
aacactttcc caccatggtt acagaaacca 
catactaaag ggatatataa atttgacatc 
30 aaacaaactg acctttgtat ctatgtcctg 
acctctaaga agtaatgctc cgcaaccaaa 
tccaggatca gcagccaacg caatcgacct 
atctatttac atagctctgg aactagatcc 
ctaaagactt ccaaacagat tcctgagtaa 
3 5 tatataaaat caaagaaaac tcaggtttat 
cttaaccact ctcccatgct atcaaaaacc 
atgagctctt gggaagatca ttatggattt 
agactgcaag aactactcca aacttctcca 
cagacataaa ttcttttatc aagcttcaag 
40 accaaccagg aaaacacata actttatcac 
acataaacca tcctttggga cgaaaggaaa 
agctattctt tcggatggat tataatgaat 
acattactca aaggcgaaga taaacttacc 
caatgggttt atccaatcga gcaagcttag 
45 aatctatcca agaagcttcc ttaacaacaa 
tcggctttcc ctccaaaacc gaagaagacg 



aggaatcaca 


aatgagtcct 


tgtaaagttt 


720 


aaagtgcaag 


aaaacctatg 


gatgctgtgc 


780 


caccgaatct 


aagggctgtt 


gacttaacac 


840 


gacacaatat 


attagacatt 


agtccatgga 


900 


aattacttgc 


ataaaaactg 


aacttgagct 


960 


actatttcat 


tttcaatggg 


ccacaaaggt 


1020 


ggaagacttt 


tatggggcta 


actcttcact 


1080 


gagatcaggg 


taattttctt 


ctttatgatt 


1140 


ttgcatgaga 


catgaaacaa 


aagctgaaaa 


1200 


aaaacaaagc 


aagcaatatg 


cacacattga 


1260 


aatagatttt 


gaagctaaaa 


aataaaaagt 


1320 


taaacaaggg 


tccagcagac 


tccggagaga 


1380 


cttcctttaa 


ctgcagtcca 


tcctaacaat 


1440 


gcttgaggct 


gcaattataa 


aaacgaatca 


1500 


ctaactttga 


gaagccagaa 


tagatttaaa 


1560 


ggtaggaaaa 


gacaagtaac 


aaagatgaag 


1620 


ctgaaatgag 


ctcaagtaga 


aaagaatttg 


1680 


atatcttcag 


aatttgggcc 


aactacataa 


1740 


aacaaacact 


tgcaaatacg 


cgactttagg 


1800 


tgttaagttg 


gagaagaatc 


cctaagccta 


1860 


cttgaaatgg 


tgtcaaaaga 


ccactggcga 


1920 


tataaaagaa 


atttgtaaga 


caacataatc 


1980 


ctttacatac 


aagcaaccca 


tctttgttta 


2040 


gtgtatctac 


cctaactact 


aaacacatat 


2100 


atatctaaca 


cctgaagtct 


ttcacttttt 


2160 


tattacccaa 


aacagagata 


tgactggaaa 


2220 


atcaagacca 


gttccagatg 


tcaaagcaat 


2280 


tagttacaca 


aaacatgttt 


cctaaaccaa 


2340 


actttatcac 


cataccataa 


gatagcttaa 


2400 


atcaagcaga 


tcatttatag 


tacaaccagc 


2460 


taaagccata 


tatttaaaac 


ttggaaggct 


2520 


atacaacaat 


gatggagatt 


cagagtatcg 


2580 


atgacgaaac 


atggaacatc 


gttataatat 


2640 


gaaacccagt 


ggaactatag 


tactgtaaca 


2700 


agcattatcc 


aatcctgatt 


tctgccaatc 


2760 


tcagctcaag 


atcatactac 


ctaattgcct 


2820 


gataactgaa 


aaaagtaaca 


gagaaatagc 


2880 


ctgatatgta 


tgtagtctaa 


caataataaa 


2940 


agcaagttag 


tcagaaaaca 


tcacagccaa 


3000 


ataaaactaa 


atttaatgta 


atctgactta 


3060 


ctatataaac 


atgcagtctt 


tctttccctc 


3120 


ctcaaaagtg 


aaatgtcttg 


attctcagct 


3180 


acatacaagg 


ccacgcaagc 


aaccaagttc 


3240 


cataacctct 


aacttcttct 


ggtaaataca 


3300 


caccatcact 


cttctcctta 


tcatctttct 


3360 


acgacattcc 


acaaattaat 


ctgtaattcc 


3420 
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aaccaacacc aaaaaacttc tcctgatgca 
tatcattcca tgaaggataa cacttagtga 
tggacaagga tttatgttgt gattgcaaaa 
cggaagattt caacaaccgt cttgaaacac 
5 gaaattgttg cctggaagaa acaaagactt 
acgaaagcta acttctcaag agaatcagat 
ctaatttcag tttcgagtga tgaagcctta 
ctctcagaga taatcgaatt ccttaaacaa 
acaacaaaaa aaatcagatt aacaaccgac 

10 agcacgtcgt ctcggagcaa gacttcttct 
tgtagattat tatatttggg ccgaaacaat 
gaaacacgta cagtatgcat ttaggctcca 
aaactaatca acccctacct tacttatttc 
aagaaacact tttatttatc ttttcgggac 

15 tgagcgtcag acacatatta gccttatcag 
aagaagcaac aatcaatgtc ggagaaatta 
tgtgcatatg gagtcagttg ccgatcatat 
actaatagat 



PCT/US00/10368 

attctcttcc tttactccat acttggtaat 3480 

aaggatttgt gtaatgggta gtcacaggat 3540 

gagcagagga agaagatgga gttacggaga 3 600 

gggagagccc aaaaaacgcc atctttgaga 3 660 

gagatttcaa acgtaagtga attcttacga 3720 

tagtgattcc tcaaaaacaa acaaaactat 3780 

agaatctaga acctccatgg cgtttctaat 3 840 

tcaaagctta gaaagagaag aacaacaaca 3 900 

cagagagcaa cgacgacgcc ggcgagaaag 3 960 

ccagtaaccc ggatggatcg ttaatgggcc 4020 

tgggtcagca aaaacttggg ggataatgaa 4080 

aattaattgg ccatataatt cgaatcagat 4140 

tcactgtttt tatttctacc ttagtagttg 4200 

ccaaatttga taggatcggg ccattactca 4260 

attagtgggg taaggttttt ttaattcggt 4320 

aagaatctgc atgggcgtgg cgtgatgata 43 80 

ataactattt ataaactaca tataaagact 4440 

4450 



20 <210> 93 

<211> 2850 
<212> DNA 

<213> Arabidopsis sp 



25 <400> 93 

aattaaaatt tgagcggtct aaaccattag 
gtcgattttc acgtcttgaa catatattgg 
tattggtcaa agaaaaacaa ccatggccca 
gtacccctga attctctgaa atatatttga 

3 0 tcctgtaaaa gtgaaggagt caccgtgact 
gatgaagaag tttctctcgt tctcctccaa 
tggcgaagat ctgttgttta tcgtttctct 
aaccctagac tgattccttg gtcccgcgaa 
ccggtctcga cggaatcaac tgctaagtta 

35 cgagtttttg ccactgctac tgccgccgct 
tctagagttg cggctttggc tggattaggg 
tctaaagcta aacttaggta tgtgtttact 
ccaattgttg gattcttaaa ttctcatttg 
acttctggaa ctgggtatat tctgggtacg 

40 tgttacacat gtgcaggaac catgatgatt 
attgaaatgt tgagaagttc ataaatttcg 
gcttgcttat gtttatgtag ttgaaaagtt 
cgcttgtttg ttttttcatt ttctagattt 
gaacgatgct aaggccattg ccttcaggac 

45 ctactattgc tggtgcttct ggtgcttgtt 
ttatatgtga tttctttgtt ttatgaatgg 



accgtttaga gatccctcca acccaaaata 60 

gccttaatct gtgtggttag taaagacttt 120 

acatgttgat acttttattt aattatacaa 180 

ttgacccaga tattaatttt aattatcatt 240 

cgtcgtaatc tgaaaccaat ctgttcatat 300 

cgcgtagaaa attctgacgg cttaacgatg 360 

tcaagaatct ctgtttcttc ttcgttacca 420 

ttatgtgccg ttaatagctt ctcccagcct 480 

gggatcactg gtgttagatc tgatgccaat 540 

acagctacag ctaccaccgg tgagatttcg 600 

catcactacg ctcgttgtta ttgggagctt 660 

tttcttttct catgaaaaat ctgaaaattt 720 

ttttatggtt gtagtatgct tgtggttgca 780 

ggaaatgctg caattagctt cccggggctt 840 

gctgcacctg ctaattcctt gaatcaggtc 900 

aatccttgtt gtgtttatgt agttgatctt 960 

taaaaatttc taatccttgg tagttgatct 1020 

ttgagataag caatgattct aagatgaaaa 1080 

gtattagtgt tccacacgct gttgcatggg 1140 

tgttggccag caaggtgaat gtttgttttt 1200 

gtgattgaga gattatggat ctaaactttt 1260 
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gcttccacga caaggttatt gcagactaat 
cttgtacttt atgcgtttgt ttatactccg 
gttggcgctg ttgttggtgc tatcccaccc 
ctttatttta gcagattctg ttttgttgga 
5 ttcaccaatt ctatgcttat ctattttgtg 
gatttcatac aattcgatga ttcttccagc 
tatggccctt gcacatctct gccgcaatga 
gtcatatgag attagaatgt ctccttccat 
gtggaatgat cagagtgtcc tagatagtgt 

10 atgagttctt tccgttagag ataaacattc 
acttctgatt ttgtttcttg gtaccttgtt 
ccgtcaggga agagaatagc agcagtggct 
ggtttcatcg cctatgactg tgagtcttgt 
tgcattgctg tatctgattt ttgctgttcc 

15 caagttggtt ttgcctcgaa tcaacacttc 
cattctaccg agaccggacc atgcataaag 
tccttcctgt tttcatgtct ggtcttcttc 
aactcgtaga agaagccgga ttaacaaatt 
gaaagaaacg tgtggctcaa cctccggtgg 

20 tcccagctcc ttccttctac tctccatgat 
aacagaaatt aaaaaaaaaa tctgaaaagt 
gtggagaacg catacaagtt tatgtatttt 
ctaagtatgt ttcaaatgat acaaaataca 
aatttttgag ctttgacgtg ttaggtctat 

25 gaaatgaaat ccgataaccg atgatggtgt 
aggtctcgag tctcgacggc tgcggaaatc 
cacgaagatg gcgatgaggt tgaaatcaat 



atgttggctg ctggacttgc atctgccaat 1320 

ttgaagcaac ttcaccctat caatacatgg 13 80 

ttgcttgggt aaatttttgt tccttttctt 1440 

tactgctttt aattcaaaat gtagtcatgg 1500 

tgttgtcagg tgggcggcag cgtctggtca 1560 

tgctctttac ttttggcaga tacctcattt 1620 

ttatgcagct ggagggtaag accatatggt 1680 

gtagtgttga tcttgaacta gttcaatttc 1740 

cacagcagtc gacattttag tggctagata 1800 

gcgaacattg tttccagctt ccgcgaccca 1860 

ttcagttaca agatgttgtc actctttgat 1920 

ctaaggaact gcttttacat gatccctctc 1980 

agattcatct tttttttgta gtttattgac 2040 

ttccaatttt tgtgacaggg gggttaacct 2100 

tcacactagc aatcgctgca acagcatttt 2160 

caaggaaaat gttccatgcc agtcttctct 222 0 

tacaccgtgt ctctaatgat aatcagcaac 22 80 

ctgtatctgg tgaagtcaaa actcagaggc 2340 

cttatgcctc tgctgcaccg tttcctttcc 2400 

aacctttaag caagctattg aatttttgga 2460 

tcttaagttt aatctttggt taataatgaa 2520 

ttctcatctc cacataattg tattttttct 2580 

tactttatca attatctgat caaattgatg 2640 

ctaataaacg tagtaacgaa tttggttttg 2700 

agagttaaac gattaaaccg ggttggttaa 27 60 

ggaaaatcac gattgaggac tttgagctgc 2820 

2850 



<210> 94 

30 <211> 3660 

<212> DNA 

<213> Arabidopsis sp 



<400> 94 

35 tatttgtatt tttattgtta aattttatga 

tattagattt attttttggg ctttatttgg 

tcaatgaaac cctaatgggt tttgtttggg 

caatgaaggt cctttgtcca acaaaactaa 

tcgtgccaca tggcagttat tgaatcaaag 

40 atctccggta acggacaacc actcgtttcc 

ccagtctccg gcttaactac caccagagac 

gattctcaac actgtctcca ccatccactc 

agtcctctct cttcggaatt cggattccgt 

gacgttgatc tacgaatcac ccggtagtta 

45 tcttcgattt tttttttact gatcttgttg 
gcgtgcggcg gagactgata ctgataaagg 



tttcacccgg tatatatcat cccatattaa 60 

gttttcgatt taaactgggc ccattctgct 120 

ctttggattt aaaccgggcc cattctgctt 180 

catccgacac aactagtatt gccaagagga 240 

gccgccaaaa ctgtaacgta gacattactt 300 

cgaaacagca actcacagac tcacaccact 3 60 

gattctctct tccgtcggtt ctatgacttc 420 

ttccagagtt acctccgtcg atcgagtcgg 480 

tgagttcact cgccggcgtt ctggtttctc 540 

gcattctgtt ggatagattg atgaatgttt 600 

tggatctctc gtagggcgga gatttgttgt 660 

tatgattttt tagttgtttt tattttctct 720 
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ctcttcaaaa ttctcttttc aaacactgtg 
cagacacctg acaaggcacc agccggtggt 
ggagcatctc aagaaactgt aattttgttc 
ttgtggataa tgatgtgtta gtttaggaat 
5 caagtcttgt ttttagctta gaaatgatgt 
tgttgttata ttctgtattc agaataaatg 
cacttggcct ccactggttt ggggagtcgt 
gaacctcttt tggatcatgc aatactgtac 
ttcttctggc agggaacttt cattggaccc 

10 tgatgatgtc tggtccttgt cttactggct 
gctgacttgt tcttattcta gtgcatttgc 
ttccagacaa tcaacgactg gtatgataga 
ccaattccat ctggagcaat atcagagcca 
ttatctcttt tgtgattctg atttctcctt 

15 ctgggtgcta ttattgggag gtcttggtat 
gcccttctga cattaactag tacagttaaa 
tatcaggcag ggcataccac tcccactgtc 
tcttatatat actctgctcc acctcttaag 
gtgataagac actccatcca agttttggag 

2 0 ttgcagctaa aacaaaatgg atgggttgga 
ttgccatggt aagatatctc gtgtatcaat 
ttgtttcttg ctcacttgac tgataggtgg 
ccagatgttg ttgttctaac actcttgtac 
ttttatgttg cttttttcgt tatctgttgt 

25 tgataatgca gttaggaata gccattgtta 
cattaggact tcagtctctc ccagtagctt 
ttggtgctat agacattact cagctttctg 
gcagctgtgg cttctatttc ttttccttga 
agcacaaatt aatgaagctg aatcaacaaa 

30 tgagctaatg aagaggaggc atctactttt 
atttcatgct tctaaaacaa gtattttcaa 
tcatttgtac ttttactagt ggatgagtta 
gtagagatca tcattagtat atgtctattt 
aaccttatta tgcgttggcg ttggttgctt 

35 agacgttaac agtctcacat tataattaat 
ctcgcttcta taaactgcag tttaaatact 
agtaccaggt aagtcaactt agtacacatg 
tctct.taatc agaagttgct tgaaacactc 
cttggtgctc ggaatatttg taacggcatt 

40 gatggggttt tgtcgaaagc agaggtgttg 
aactagttta aaagattttg taaaatgtat 
gtatcaattt agcaaaacgg ctgagaaatt 
ttttgcattt cctgctcata tcgaggattg 
tttcagaatg tttttgtttt ctgtagtgga 

45 attctaaaca tgtatccaca taaaaacagt 
tttataatct aaatctaaca actagctagt 



gcgtttgaat 


ttccgacggc 


agttaaatct 


780 


tcaagcatta 


accagcttct 


cggtatcaaa 


840 


atctcctcag 


aatcttttaa 


attatcatat 


900 


tttcctacta 


aaggtaatct 


cttttgagga 


960 


gaaaatgttg 


tttgttagct 


aaaaagagtt 


1020 


gaagattcgt 


cttcagctta 


caaaaccagt 


1080 


ctgtggtgct 


gctgcttcag 


gtaatcatac 


1140 


agaaagtttt 


ttcattttcc 


ttccaattgt 


1200 


cagaggatgt 


tgctaagtcg 


attctttgca 


1260 


atacacaggt 


ctggttttac 


acaacaaaaa 


1320 


ttggtgctac 


aataacctag 


acttgtcgat 


1380 


gatatcgacg 


caattaatga 


gccatatcgt 


1440 


gaggtaactg 


agacagaaca 


ttgtgagctt 


1500 


actccttaaa 


atgcaggtta 


ttacacaagt 


1560 


tgctggaata 


ttagatgtgt 


gggtaagttg 


1620 


gggcacatca 


gatttgctaa 


aatcttccct 


1680 


ttctatcttg 


ctttgggagg 


atcattgcta 


1740 


gtaagtttta 


ttcctaactt 


ccactctcta 


1800 


ttttgaatat 


cgatatctga 


actgatctca 


1860 


aattttgcac 


ttggagcaag 


ctatattagt 


1920 


aatatatggc 


gttgttctca 


tctcattgat 


1980 


gctggccaag 


cattgtttgg 


cactcttacg 


2040 


agcatagctg 


gggtactctt 


ttggcaaacc 


2100 


aatatgctct 


tgcttcatgt 


tgtacctttg 


2160 


acgacttcaa 


aagtgttgaa 


ggagatagag 


2220 


ttggcaccga 


aactgcaaaa 


tggatatgcg 


2280 


ttgccggtat 


gtactatcca 


ctgtttttgt 


2340 


tcttatcaac 


tggatattca 


ccaatggtaa 


2400 


ggcaaaacat 


aaaagtacat 


tctaatgaaa 


2460 


atgtttcatt 


agtgtgattg 


atggattttc 


2520 


cagtgtcatg 


aaataacaga 


acttatatct 


2580 


cacaatcatt 


gttatagaac 


caaatcaaag 


2640 


tggttgcagg 


atatctatta 


gcatctggga 


2700 


tgatcattcc 


tcagattgtg 


ttccaggtaa 


2760 


caaattcttg 


tcactcgtct 


gattgctaca 


2820 


ttctcaagga 


ccctgtcaaa 


tacgacgtca 


2880 


tttgtgttct 


tttgaaatat 


ctttgagagg 


2940 


atcttgatta 


caggcaagcg 


cgcagccatt 


3000 


agcatcgcaa 


cactgaaaaa 


ggcgtatttt 


3060 


acacatcaaa 


tgtgggcaag 


tgatggcatc 


3120 


gtaccgttat 


tactagaaac 


aactcctgtt 


3180 


gtaattgatg 


ttaccgtatt 


tgcgctccat 


3240 


gggtttatgt 


tagttctgtc 


acttctctgc 


3300 


ttttaactat 


tttcatcact 


ttttgtactg 


- 3360 


aatatacaaa 


aatgatactt 


cctcaaactt 


3420 


aacccaacta 


acttcataca 


attaatttga 


3480 
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gaaactacaa agactagact atacatatgt tatttaacaa cttgaaactg tgttattact 3 540 
acctgatttt tttctattct acagccattt gatatgctgc aatcttaaca tatcaagtct 3 600 
cacgttgttg gacacaacat actatcacaa gtaagacacg aagtaaaacc aaccggcaac 3660 
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(57) Abstract: A method for the prdouctin of plants with an increased tocipherol and tocotrienol content by overexpression of a 
gene coding for 2-methyl-6-phytylhydroquinone-meihyltransferase. 

(57) Zusammenfassung: Verfahren zur Herstellung von Pflanzen mit erhbhtem Gehalt an Tocopherol und Tocotrienolen durch 
Uberexpression eines Gens codierend fur eine 2-Mehtyl-6-phytylhydrochinon-methyltransferase. 
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R3 



R2 = H 



40 la , a -Tocopherol : R i = R* = R 3 = CH 3 

lb, fJ-Tocopherol [148-03-8] : R 1 = R = CH3, 

lc '-Tocopherol [54-28-4] : R 1 = H, R 2 = R 3 = CH, 

W S-Tocopherol [119-13-11: R 1 = R 2 = H, R = CH 3 
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2a, a-Tocotrienol [1721-51-3] : R 1 = R 2 = R 3 = CH 3 

2b, p-Tocotrienol [490-23-3]: R 1 = R 3 = CH 3 , R 2 = H 

10 2c, y-Tocotrienol [14101-61-2]: R 1 = H, R 2 = R 3 = CH 3 

2d, 5-Tocotrienol [25612-59-3]: R 1 = R 2 = H, R 3 = CH 3 

Wirtschaftlich groSe Bedeutung besitzt a-Tocopherol . 

15 Der Entwicklung von Kulturpf lanzen mit erh5htem Gehalt an Toco- 
pherolen und Tocotrienolen durch klassische Ziichtungsmethoden 
sind Grenzen gesetzt. 

Eine sinnvolle Alternative ist das gentechnische Vorgehen, 
20 beispielsweise die fiir die Tocopherol Syntheseleistung kodieren- 
den, essentiellen Biosynthesegene zu isolieren und in Kultur- 
pf lanzen gezielt zu iibertragen. Dieses Verfahren setzt voraus, 
daS die Biosynthese und deren Regulation bekannt ist und daS 
Gene, die die Biosyntheseleistung beeinf lussen, identif iziert 
25 werden. 

Isoprenoide oder Terpenoide bestehen aus verschiedenen Klassen 
lipidloslicher Molekiile und werden teilweise oder vollstandig aus 
C 5 -Isopren-Einheiten gebildet. Reine Prenyllipide (z.B. 
30 Carotinoide) bestehen aus C-Gerusten, die ausschlieSlich auf Iso- 
pren-Einheiten zuruckgehen, wahrend gemischte Prenyllipide (z.B. 
Chlorophylle, Tocopherole und Vitamin K) eine Isoprenoid-Seiten- 
kette besitzen, die mit einem aromatischen Kern verbunden ist. 

35 Ausgangspunkt der Biosynthese von Prenyllipiden sind 3 x Acetyl- 
CoA Einheiten, die iiber S-Hydroxymethylglutaryl-CoA (HMG-CoA) und 
Mevalonat in die Ausgangs-Isopren-Einheit <C5), dem Isopentenyl- 
pyrophosphat (IPP) , umgewandelt werden. Kiirzlich wurde durch in 
vivo Fiitterungsexperimente mit C" gezeigt, daS in verschiedenen 

40 Eubakterien, Griinalgen und pflanzlichen Chloroplasten ein Mevalo- 
nat-unabhangiger Weg zur Bildung von IPP beschritten wird. Dabei 
werden Hydroxyethyl thiamin, das durch Decarboxylierung von Pyru- 
vat entsteht, und Glycerinaldehyd-3-Phosphat (3-GAP) in einer 
durch die l-Deoxy-D-Xylulose-5-Phosphat Synthase vermittelten 

45 "Transketolase"-Reaktion zunachst in 1-Deoxy-D-Xylu- 

lose-5-phosphat umgewandelt (Lange et al, 1998; Schwender et al, 
1997; Arigoni et al, 1997; Lichtenthaler et al, 1997; Sprenger et 
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al, 1997) . Dieses wird dann durch eine intramolekulare Umordnung 
in 2-C-Methyl-D-Erythritol-4-Phosphat und im weitererx zu IPP um- 
gesetzt (Arigoni et al, 1997; Zeidler et al, 1998). Biochemische 
Daten deuten darauf hin, dafi der Mevalonat-Weg im Zytosol ope- 
5 riert und zur Bildung von Phytosterolen fiihrt. Das Antibiotikura 
Mevinolin, ein spezifischer Inhibitor der Mevalonat -Bildung, 
fuhrt lediglich zur Inhibition der Sterol-Biosynthese im Zyto- 
plasma, wahrend die Prenyllipid-Bildung in den Plastiden unbeein- 
flufit ist (Bach & Lichtenthaler, 1993). Der Mevalonat-unabhSngige 
10 Weg ist dagegen plastidar lokalisiert und fiihrt vornehmlich zur 
Bildung von Carotinoiden und plastidaren Prenyllipiden (Schwender 
et al, 1997; Arigoni et al, 1997). 

IPP steht im Gleichgewicht mit seinem Isomer, dem Dimethylallyl 
15 Pyrophosphat (DMAPP) . Eine Kondensation von IPP mit DMAPP in 

Kopf-Schwanz Anlagerung ergibt das Monoterpen (CIO) Geranyl-Pyro- 
phosphat (GPP) . Die Addition von weiteren IPP Einheiten fiihrt zum 
Sesquiterpen (CIS) Farnesy-Pyrophosphat (FPP) und zum Diterpen 
(C20) Geranyl-Geranyl-Pyrophosphat (GGPP) . Die Verkniipfung zweier 
20 GGPP Molekiile fuhrt zur Bildung der C40-Vorlauf er fur 
Carotinoide . 

Bei gemischten Prenyllipiden ist die Isopren-Seitenkette ver- 
schiedener Lange mit Nicht-Isopren Ringen verbunden wie 

25 beispielsweise ein Porphyrin-Ring bei Chlorophyll a und b. Die 
Chlorophylle und Phylloquinone enthalten eine C20 Phytyl-Kette, 
in der nur die erste Isopren-Einheit eine Doppelbindung enthalt. 
GGPP wird durch die Geranylgeranyl-Pyrophosphat-Oxidoreduktase 
(GGPPOR) zum Phytyl -Pyrophosphat (PPP) umgeformt, dem Ausgangs- 

30 stoff fur die weitere Bildung von Tocopherolen. 

Bei den Rings truktur en der gemischten Prenyllipide, die zur 
Bildung der Vitamine E und K fuhren, handelt es sich um Quinone, 
deren Ausgangsmetabolite aus dem Shikimat-Weg stammen. Die aroma- 

35 tischen Aminosauren Phenylalanin bzw. Tyrosin werden in Hydroxy- 
phenyl-Pyruvat umgewandelt, welches durch Dioxygenierung in Homo- 
gentisinsaure iiber fuhrt wird. Das Chorismat wird ausgehend von 
Erythrose-4-Phosphat und Phosphoenolpyruvat (PEP) durch deren 
Kondensation zu 3~deoxy-D-Arabino-heptulosonat-7-Phosphat (DAHP) 

40 iiber die Zwischenstuf en des Shikimatweges 3 1 -Dehydroquinat , 

3*-Dehydroshikimat, Shikimat, Shikimat-3-Phosphat und 5*-Enolpy- 
ruvylshikimat-3-Phosphat gebildet. Dabei wird das Eryt- 
hrose-4-Phosphat vom Calvinzyklus gebildet und das PEP von der 
Glykolyse bereitgestellt . Die oben beschriebene Homogentisinsaure 

45 wird anschliefcend an Phytylpyrophosphat (PPP) bzw. Geranylgera- 
nylpyrophosphat gebtmden, um die VorlSufer von a-Tocopherol und 
a-Tocotrienol, das 2-Methyl-6-phytylhydrochinon bzw. das 
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2-Methyl-6-geranylgeranylhydrochinon zu bilden. Durch Methylier- 
ungsschritte mit S-Adenosylmethionin als Methyl-Gruppen-Donor 
entsteht zunachst 2, 3-Dimethyl-6-phytylquinol, dann durch 
Zyklisierung a-Tocopherol und durch nochmalige Methylierung a- 
5 Tocopherol (Richter, Biochemie der Pflanzen, Georg Thieme Verlag 
Stuttgart, 1996) . 

in der Literatur finden sich Beispiele die zeigen, daS die Mani- 
pulation eines Enzyms den Metabolit-FluS direktional beeinflufcen 

10 kann. In Experimenten mit einer veranderten Expression der Phy- 
toen Synthase, welche zwei GGPP-Molekiile zu 15-cis-Phytoen mit- 
einander verkniipft, konnte ein direkter EinfluS auf die Carotino- 
id-Mengen dieser transgenen Tomatenpf lanzen gemessen werden (Fray 
und Grierson, Plant Mol. Biol. 22 (4) , 589-602 (1993) ; Fray et al., 

15 Plant J , 8, 693-701(1995)). Wie zu erwarten, zeigen transgene 
Tabakpflanzen mit verringerten Mengen an Phenylalanin-Ammonrum 
Lyase reduzierte Phenylpropanoid-Mengen. Das Enzym Phenylalamn- 
Ammonium Lyase katalysiert den Abbau von Phenylalanin, entzieht 
es also der Phenylpropanoid-Biosynthese (Bate et al.,Proc. Natl. 

20 Acad. Sci USA 91 (16): 7608-7612 (1994); Howies et al., Plant 
Physiol. 112. 1617-1624(1996)). 

Uber die ErhShung des Metabolitf lusses zur Steigerung des Toco- 
pherol- bzw. Tocotrienolgehaltes in Pflanzen durch Uberexpressxon 

25 einzelner Biosynthesegene ist bisher wenig bekannt. LedigUch WO 
97/27285 beschreibt eine Modifikation des Tocopherol-Gehaltes 
durch verstarkte Expression bzw. durch Herunterregulation des En- 
zyms p-Hydroxyphenylpyruvatdioxygenase (HPPD) . WO 99/04622 be- 
schreibt eine Gensequenz codierend fur eine y-Tocopherolmethyl- 

30 transferase aus einem photosynthetisch aktiven Organismus. 

WO 99/23231 zeigt, da6 die Expression einer Geranylgeranyl -Reduc- 
tase in transgenen Pflanzen eine gesteigerte Tocopherolbxosynt- 
hese zur Folge hat. 

35 Aufgabe der vorliegenden Erf indung war die Entwicklung einer 

transgenen Pf lanze mit erhShtem Gehalt an Tocopherolen und Toco- 
trienolen . 

Die Aufgabe wurden iiberraschenderweise gel6st durch die tiber- 
40 expression eines 2-Methyl-6-phytylhydrochinon-methyltransf erase 

Gens in Pflanzen. 

Zu diesem Zweck wurde in transgenen Pflanzen die Aktivitat der 
2-Methyl-6-phytylhydrochinon-methyltransferase ( MPMT ) durch 
45 Uberexpression des MPMT-Gens aus Synechocystis spec. PCC 6803 er- 
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hflht. Dies kann prinzipiell durch Expression homologer oder hete- 
rologer MPMT-Gene erreicht werden. 

in Beispiel 2 wird erstmals die Klonierung einer MPMT-DNA-Sequenz 
S (SEQ-ID Nr. 1) aus Synechocystis spec. PCC 6803 beschrieben. Urn 
eine Plastidenlokalisation zu gewahrleisten wird der MPMT- 
Nukleotidsequenz aus Synechocystis eine Transitsignalsequenz 
(Abb 3, Abb. 4) vorangestellt. Auch geeignet als Expressionskas- 
sette ist eine DNA-Sequenz, die fur ein MPMT-Gen codiert, das mit 
10 SEQ-ID Nr. 1 hybridisiert , bzw. zur Gesamtsequenz oder zu Teil- 
sequenzen homolog ist und das aus anderen Organismen bzw. aus 
Pflanzen stammt. 

Das durch die zusatzliche Expression des MPMT-Gens nun vermehrt 
15 zur Verfugung stehende 2, 3-Dimethyl-6-phytylhydrochinon wird wel- 
ter in Richtung Tocopherole und Tocotrienol umgesetzt (Abbildung 

1) • 

Die Herstellung der transgenen Pflanzen erfolgt durch Transforma- 
20 tion der Pflanzen mit einem das MPMT-Gen enthaltenden Konstrukt. 
Als Modellpflanzen fur die Produktion von Tocopherolen und Toco- 
trienolen wurden Arabidopsis thaliana, Brassica napus und 
Nicotiana tabacum eingesetzt. 

25 Messungen an MPMT-Synechocystis knock out Mutanten ergaben beziig- 
lich des Gehaltes an Tocopherolen und Tocotrienolen eine drasti 
sche Abnahme. Dies belegt den direkten EinfluS der plastidaren 
pflanzlichen MPMT auf die Synthese von Tocopherolen und Tocotrie- 
nolen . 

3° 

Gegenstand der Erfindung ist die Verwendung einer DNA-Sequenz 
SEQ-ID Nr. 1 aus Synechocystis spec. PCC 6803, die fur eine MPMT 
oder deren funktionelle Aquivalente kodiert, zur Herstellung 
einer Pf lanze mit erhohtem Gehalt an Tocopherolen und Tocotneno- 
35 len. Die Nukleinsauresequenz kann dabei z.B. eine DNA- oder cDNA- 
Sequenz sein. Zur Insertion in eine Expressionskassette geeignete 
kodierende Sequenzen sind beispielsweise solche, die fur eine 
MPMT kodieren und die dem Wirt die Fahigkeit zur Oberproduktion 
von Tocopherolen und Tocotrienolen verleihen. 
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Die Expressionskassetten beinhalten auSerdem regulative Nuklem- 
sauresequenzen, welche die Expression der kodierenden Sequenz m 
der Wirtszelle steuern. GemaS einer bevorzugten Aus fuhrungs form 
umfafit eine Expressionskassette stromauf warts, d.h. am 5'-Ende 
45 der kodierenden Sequenz, einen Promoter und stromabwarts, d.h. a 
3'-Ende, ein Polyadenylierungssignal und gegebenenf alls weitere 
regulatorische Elemente, welche mit der dazwischenliegenden ko- 
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dierenden Sequenz fur das MPMT-Gen operativ verknttpft sind. Unter 
einer operativen Verkniipfung versteht man die sequenzielle Anord- 
nung von Promotor, kodierender Sequenz, Terminator und ggf . wei- 
terer regulativer Elemente derart, daS jedes der regulativen Ele- 
5 mente seine Funktion bei der Expression der kodierenden Sequenz 
bestimmungsgemafi erfilllen kann. Die zur operativen Verknupfung 
bevorzugten aber nicht darauf beschrankten Sequenzen sind Targe- 
ting-Sequenzen zur Gewahrleistung der subzellularen Lokalisation 
im Apoplasten, in der Vakuole, in Plastiden, im Mitochondrium, in 
10 Endoplasmatischen Retikulum <ER) , im Zellkern, in Olkdrperchen 
oder anderen Kompartimenten und Translationsverstarker wie die 
5 ' -Filhrungssequenz aus dem Tabak-Mosaik-Virus (Gallie et al., 
Nucl. Acids Res. 15 (1987), 8693 -8711). 

15 Beispielhaft kann die pflanzliche Expressionskassette in ein De- 
rivat des Transf ormationsvektors pBin-19 mit 35s Promotor (Bevan, 
M Nucleic Acids Research 12: 8711-8721 (1984)) eingebaut wer- 
den. Abbildung 4 zeigt ein Derivat des Transf ormationsvektors 
pBin -19 mit samenspezif ischem Legumin B4-Promotor. 

20 

Als Promotoren der Expressionskassette 1st grundsatzlich jeder 
Promotor geeignet, der die Expression von Fremdgenen in Pflanzen 
steuern kann. Vorzugsweise verwendet man insbesondere einen 
pflanzlichen Promotor oder einen Promotor, der einem Pflanzenvi 

25 rus entstammt. Insbesondere bevorzugt ist der CaMV 35S-Promotor 
aus dem Blumenkohl-Mosaik-Virus (Franck et al., Cell 21 (1980), 
285 - 294) . Dieser Promotor enthalt bekanntlich unterschiedliche 
Erkennungssequenzen fur transkriptionale Effektoren, die in ihrer 
Gesamtheit zu einer permanenten und konstitutiven Expression des 

30 eingefuhrten Gens fiihren (Benfey et al., EMBO J. 8 (1989), 
2195-2202) . 

Die Expressionskassette kann auch einen chemisch induzierbaren 
Promotor enthalten, durch den die Expression des exogenen MPMT- 

35 Gens in der Pflanze zu einem bestimmten Zeitpunkt gesteuert wer- 
den kann. Derart ige Promotoren wie z.B. der PRPl-Promotor (Ward 
et al , Plant. Mol. Biol. 22 (1993), 361-366), ein durch Salizyl- 
saure induzierbarer Promotor (WO 95/19443), ein durch Benzenesul- 
fonamid-induzierbarer (EP-A 388186), ein durch Tetrazyklm- 

40 induzierbarer (Gatz et al., (1992) Plant J. 2, 397-404), exn 
durch Abscisinsaure-induzierbarer (EP-A 335528) bzw. ein durch 
Ethanol- Oder Cyclohexanon-induzierbarer (WO 93/21334) Promotor 
kSnnen u.a. verwendet werden. 

45 weiterhin sind insbesonders solche Promotoren bevorzugt, die die 
Expression in Geweben oder Pf lanzenteilen sicherstellen, in denen 
beispielsweise die Biosynthese von Tocopherol bzw. dessen Vorstu- 
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fen stattfindet. Insbesondere zu nennen sind Promotoren, die eine 
blattspezifische Expression gewahrleisten. Zu nennen sind der 
Promoter der cytosolischen FBPase aus Kartoffel Oder der ST-LSI 
Promotor aus Kartoffel (Stockhaus et al., EMBO J. 8 (1989), 
S 2445 - 245) . 

Hit Hilf e eines samenspezif ischen Promoters konnte ein Fremd- 
protein stabil bis zu einem Anteil von 0,67 % des gesamten losli- 
chen Samenproteins in den Samen transgener Tabakpf lanzen expn- 

10 miert werden (Fiedler und Conrad, Bio/Technology 10 (1995) , 
1090-1094) . Die Expressionskassette kann daher beispielsweise 
einen samenspezif ischen Promotor (bevorzugt den Phaseolin- 
Promotor (US 5504200), den USP- (Baumlein, H. et al., Mol. Gen. 
Genet. (1991) 225 (3), 459 - 467) oder LEB4-Promotor (Fiedler und 

15 Conrad, 1995)), das LEB4-Signalpeptid, das zu exprimierende Gen 
und ein ER-Retentionssignal enthalten. 

Die Herstellung einer Expressionskassette erfolgt durch Fusion 
eines geeigneten Promotors mit einer geeigneten MPMT-DNA Sequenz 

20 und vorzugsweise einer zwischen Promotor und MPMT-DNA-Sequenz in- 
serierten DNA, die fur ein chloroplastenspezif isches Transitpep- 
tid kodiert, sowie einem Polyadenylierungssignal nach gangigen 
Rekombinations- und Klonierungstechniken, wie sie beispielsweise 
in T Maniatis, E.F. Fritsch und J. Sambrook, Molecular Cloning: 

25 A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY (1989) sowie in T.J. Silhavy, M.L. Berman und L.W. En- 
quist, Experiments with Gene Fusions, Cold Spring Harbor Labora- 
tory, Cold Spring Harbor, NY (1984) und in Ausubel, P.M. et al . , 
Current Protocols in Molecular Biology, Greene Publishing Assoc. 

30 and Wiley-Interscience (1987) beschrieben sind. 

insbesondere bevorzugt sind Sequenzen, die ein Targeting in den 
Plastiden gewahrleisten. 

35 Es k6nnen auch Expressionskassetten verwendet werden, deren DNA- 
Sequenz fur ein MPMT-Fusionsprotein kodiert, wobei em Teil des 
Fusionsproteins ein Transitpeptid ist, das die Translokation des 
Polypeptides steuert. Bevorzugt sind fur die Chloroplasten spezi- 
fische Transitpeptide, welche nach Translokation des MPMT-Gens m 

40 die Chloroplasten vom MPMT-Teil enzymatisch abgespalten werden. 
insbesondere bevorzugt ist das Transitpeptid, das von der plasti- 
daren Nicotiana tabacum Transketolase oder einem anderen Transit- 
peptid (z.B. dem Transitpeptid der kleinen Untereinheit der Rubi- 
sco oder der Ferredoxin NADP Oxidoreduktase) oder dessen funktio- 

45 nellem Equivalent abgeleitet ist. 
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Besonders bevorzugt sind DNA-Sequenzen von drei Kassetten des 
Plastiden-Transitpeptids der plastidaren Transketolase aus Tabak 
in drei Leserastern als Kpnl /BamHI Fragmente mit einem ATG-Codon 
in der Ncol Schnittstelle: 

pTP09 

KpnI_GGTACCATGGCGTCTTCTTCTTCTCTCACTCTCTCTCAAGCTATCCTCTCTCGTTCTGTC 




10 ATCCAATCCCAATATCACCACCTCCCGCCGCCGTACTCCTTCCTCCGCCGCCGCCGCCGCCGTCG 
TAAGGTCACCGGCGATTCGTGCCTCAGCTGCAACCGAAACCATAGAGAAAACTGAGACTGCGGGA 

TCC_BamHI 
pTPlO 

1 K pnI_GGTACCATGGCGTCTTCTTCTTCTCTCACTCTCTCTCAAGCTATCCTCTCTCGTTCTGTC 
CCTCGCCATGGCTCTGCCTCTTCTTCTCAACTTTCCCCTTCTTCTCTCACTTTTTCCGGCCTTAA 
ATCCAATCCCAATATCACCACCTCCCGCCGCCGTACTCCTTCCTCCGCCGCCGCCGCCGCCGTCG 
TAAGGTCACCGGCGATTCGTGCCTCAGCTGCAACCGAAACCATAGAGAAAACTGAGACTGCGCTG 

20 GATCC_BamH I 
pTPll 

KpnI_GGTACCATGGCGTCTTCTTCTTCTCTCACTCTCTCTCAAGCTATCCTCTCTCGTTCTGTC 

2 5 CCTCGCCATGGCTCTGCCTCTTCTTCTC AACTTTCCCCTTCTTCTCTC ACTTTTTCCGGCCTTAA 

ATCCAATCCCAATATCACCACCTCCCGCCGCCGTACTCCTTCCTCCGCCGCCGCCGCCGCCGTCG 
TAAGGTCACCGGCGATTCGTGCCTCAGCTGCAACCGAAACCATAGAGAAAACTGAGACTGCGGGG 

ATCC_BamHI 

30 Die inserierte Nukleotid-Sequenz kodierend fur eine MPMT kann 
synthetisch hergestellt oder naturlich gewonnen sein oder eine 
Mischung aus synthetischen und natiirlichen DNA-Bestandteilen ent- 
halten, sowie aus verschiedenen heterologen MPMT-Genabschnitten 
verschiedener Organismen bestehen. Im allgemeinen werden synthe- 

35 tische Nukleotid-Sequenz en mit Kodons erzeugt, die von Pflanzen 
bevorzugt werden. Diese von Pflanzen bevorzugten Kodons konnen 
aus Kodons mit der hSchsten Proteinhauf igkeit bestimmt werden, 
die in den meisten interessanten Pf lanzenspezies exprimiert wer- 
den. Bei der Preparation einer Expressionskassette konnen ver- 

40 schiedene DNA-Fragmente manipuliert werden, um eine Nukleotid-Se- 
quenz zu erhalten, die zweckmaSigerweise in der korrekten Rich- 
tung liest und die mit einem korrekten Leseraster ausgestattet 
ist. Fur die Verbindung der DNA-Fragmente miteinander konnen an 
die Fragmente Adaptoren oder Linker angesetzt werden. 

45 
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ZweckmaSigerweise k6nnen die Promoter- und die Terminator-Regio- 
nen in Transkriptionsrichtung mit einem Linker Oder Polylinker, 
der eine Oder mehrere Restriktionsstellen fur die Insertion die- 
ser Sequenz enthalt, versehen werden. In der Regel hat der Linker 
5 1 bis 10, meistens 1 bis 8, vorzugsweise 2 bis 6 Restriktions- 
stellen. im allgemeinen hat der Linker innerhalb der regulaton- 
schen Bereiche eine Grofie von weniger als 100 bp, haufig weniger 
als 60 bp, mindestens jedoch 5 bp. Der Promoter kann sowohl nativ 
bzw homolog als auch fremdartig bzw. heterolog zur Wirtspflanze 
10 sein. Die Expressionskassette beinhaltet in der 5 ' -3 ' -Transkrip- 
tionsrichtung den Promoter, eine DNA-Sequenz die fur ein MPMT-Gen 
codiert und eine Region fur die transkriptionale Termination. 
Verschiedene Terminationsbereiche sind gegeneinander beliebig 
austauschbar . 

" Perner konnen Manipulationen, die passende Restriktionsschnitt- 
stellen bereitstellen oder die iiberfliissige DNA oder Restnkti- 
onsschnittstellen entfernen, eingesetzt werden. Wo Insertionen, 
Deletions oder Substitutionen wie z.B. Transitionen und Trans- 

20 versionen in Frage kommen, konnen in vitro-Mutagenese, "pnmerre- 
pair" Restriktion oder Ligation verwendet werden. Bei geeigneten 
Manipulationen, wie z.B. Restriktion, "chewing-back" oder Aufful- 
len von Uberhangen fur "bluntends", kSnnen komplementare Enden 
der Fragmente far die Ligation zur Verfugung gestellt werden. 

25 Bevorzugte Polyadenylierungssignale sind pflanzliche Polyadeny- 
lierungssignale, vorzugsweise solche, die im wesentlichen T-DNA- 
Polyadenylierungssignale aus Agrobacterium tumefaciens, ins- 
besondere des Gens 3 der T-DNA (Octopin Synthase) des Ti-Plasmids 

30 pTiACH5 entsprechen (Gielen et al., EMBO J. 3 (1984), 835 ff) 
oder funktionelle Aquivalente. 

Vorzugsweise wird die fusionierte Expressionskassette, die fur 
ein MPMT-Gen kodiert, in einen Vektor, beispielsweise pBinl9, 

35 kloniert, der geeignet ist, Agrobacterium tumefaciens zu trans- 
formieren. Mit einem solchen Vektor transformierte Agrobakterien 
kOnnen dann in bekannter Weise zur Transformation von Pflanzen, 
insbesondere von Kulturpf lanzen, wie z.B. von Tabakpf lanzen, 
verwendet werden, indem beispielsweise verwundete Blatter oder 

40 Blattstucke in einer Agrobakterienlosung gebadet und anschliefiend 
in geeigneten Medien kultiviert werden. Die Transformation von 
Pflanzen durch Agrobakterien ist unter anderem bekannt aus F.F. 
White, Vectors for Gene Transfer in Higher Plants; in Transgenic 
Plants, Vol. 1, Engineering and Utilization, herausgegeben von 

45 S.D. Kung und R. Wu, Academic Press, 1993, S. 15 - 38. Aus den 
transformierten Zellen der verwundeten Blatter bzw. Blattstucke 
konnen in bekannter Weise transgene Pflanzen regeneriert werden, 
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die ein in die Expressionskassette integriertes Gen fiir die Ex- 
pression eines MPMT-Gens enthalten. 

Zur Transformation einer Wirtspflanze mit einer fiir eine MPMT ko- 
5 dierenden DNA wird eine Expressionskassette als Insertion in 

einen rekombinanten Vektor eingebaut, dessen Vektor-DNA zusatzli- 
che funktionelle Regulationssignale, beispielsweise Sequenzen fiir 
Replikation oder Integration enthalt. Geeignete Vektoren sind 
unter anderem in "Methods in Plant Molecular Biology and Bio- 
10 technology" (CRC Press), Kap. 6/7, S. 71 - 119 (1993) beschrie- 
ben. 

Unter Verwendung der oben zitierten Rekoinbinations- und 
Klonierungstechniken konnen die Expressionskassetten in geeignete 
15 Vektoren kloniert werden, die ihre Vermehrung , beispielsweise in 
E. coli, ermoglichen. Geeignete Klonierungsvektoren sind u.a. 
pBR332, pUC-Serien, Ml3mp-Serien und pACYC184. Besonders geeignet 
sind binare Vektoren, die sowohl in E. coli als auch in Agrobak- 
terien replizieren konnen, 

20 

Ein weiterer Gegenstand der Erfindung betrif ft die Verwendung 
einer Expressionskassette enthaltend eine DNA-Sequenz SEQ-ID 
Nr. 1 oder eine mit dieser hybridisierende DNA-Sequenz zur Trans- 
formation von Pflanzen, -zellen, -geweben oder Pf lanzenteilen. 
25 Vorzugsweise ist Ziel der Verwendung die Erhohung des Gehaltes an 
Tocopherolen und Tocotrienolen der Pflanze. 

Dabei kann je nach Wahl des Promotors die Expression spezifisch 
in den Blattern, in den Samen, Blutenblattern oder anderen Teilen 
30 der Pflanze erfolgen. Solche transgenen Pflanzen, deren Vermeh- 
rungsgut, sowie deren Pf lanzenzellen, -gewebe oder -teile sind 
ein weiterer Gegenstand der vorliegenden Erfindung. 

Die Expressionskassette kann dartiberhinaus auch zur Transforma- 
35 tion von Bakterien, Cyanobakterien, Hefen, filamentosen Pilzen 
und Algen mit dem Ziel einer Erhohung des Gehaltes an Tocophero- 
len und Tocotrienolen eingesetzt werden. 

Die Ubertragung von Fremdgenen in das Genom einer Pflanze wird 
40 als Transformation bezeichnet. Es werden dabei die beschriebenen 
Methoden zur Transformation und Regeneration von Pflanzen aus 
Pflanzengeweben oder Pf lanzenzellen zur transienten oder stabilen 
Transformation genutzt. Geeignete Methoden sind die Protoplasten- 
transformation durch Polyethylenglykol-induzierte DNA-Aufnahme, 
45 das biolistische Verfahren mit der Genkanone - die sogenannte 

particle bombardment Methode, die Elektroporation, die Inkubation 
trockener Embryonen in DNA-haltiger Losung, die Mikroinjektion 



WO 0^/04330 Al 




1111 





(84) Bestimmungsstaaten (regional): ARIPO-Patent (GH, 
GM, KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), eura- 
sisches Patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
europaisches Patent (AT, BE, CH, CY DE, DK, ES, FI, 
FR GB, GR, IE, IT, LU, MC, NL, PT, SE), OAPI-Patent 
(BF, BJ, CF, CG, CI, CM, GA, GN, GW, ML, MR, NE, 
SN, TD, TG). 

Veroffentlicht: 

— Mit Mernationalem Recherchenbericht. 



— Vor Ablaufder fur Anderungen der Anspruche geltenden 
Frist; Veroffentlichung wird wiederholt, falls Anderungen 
eintreffen. 



Zur Erklarung der Zweibuchstaben-Codes, und der anderen 
Abkurzungen wird auf die Erldarungen ("Guidance Notes on 
Codes and Abbreviations") am Anfangjeder regularen Ausgabe 
der PCT-Gazette verwiesen. 



WO 01/04330 lx PCT/EPOO/05862 

und der durch Agrobacterium vermittelte Gentransfer. Die genann- 
ten Verfahren sind beispielsweise in B. Jenes et al., Techniques 
for Gene Transfer, in: Transgenic Plants, Vol. 1, Engineering and 
Utilization, herausgegeben von S.D. Kung und R. Wu, Academic 
5 Press (1993), 128 - 143 sowie in Potrykus, Annu. Rev. Plant Phy- 
siol. Plant Molec. Biol. 42 (1991), 205 - 225) beschrieben. 
Vorzugsweise wird das zu exprimierende Konstrukt in einen Vektor 
kloniert, der geeignet ist, Agrobacterium tumefaciens zu trans- 
formieren, beispielsweise pBinl9 (Bevan et al., Nucl. Acids Res. 
10 12 (1984) , 8711) . 

Mit einer Express ionskassette transf ormierte Agrobakterien kdnnen 
ebenfalls in bekannter Weise zur Transformation von Pflanzen, 
insbesondere von Kulturpf lanzen, wie Getreide, Mais, Hafer, Soja, 
15 Reis, Baumwolle, Zuckerriibe, Canola, Sonnenblume, Flachs, Hanf, 
Kartoffel, Tabak, Tomate, Raps, Alfalfa, Salat und den verschie- 
denen Baum-, NuS- und Weinspezies, verwendet werden, z.B. indem 
verwundete Blatter Oder Blattstucke in einer AgrobakterienlSsung 
gebadet und anschlieSend in geeigneten Medien kultiviert werden. 

20 

Punktionell aquivalente Sequenzen, die fur ein MPMT-Gen kodieren, 
sind solche Sequenzen, welche trotz abweichender Nukleotidsequenz 
noch die gewiinschten Funktionen besitzen. Funktionelle Aquiva- 
lente umfassen somit natvirlich vorkommende Varianten der hierin 
25 beschriebenen Sequenzen sowie kunstliche, z.B. durch chemische 
Synthese erhaltene, an den Kodon-Gebrauch einer Pflanze ange- 
paSte, kunstliche Nukleotid-Sequenzen. 

Unter einem funktionellen Aquivalent versteht man insbesondere 
30 auch naturliche oder kunstliche Mutationen einer urspriinglich 

isolierten fur eine MPMT kodierende Sequenz, welche weiterhin die 
gewunschte Funktion zeigen. Mutationen umfassen Substitutionen, 
Additionen, Deletionen, Vertauschungen oder Insertionen eines 
oder mehrerer Nukleotidreste . Somit werden beispielsweise auch 
35 solche Nukleotidsequenzen durch die vorliegende Erfindung mit um- 
faSt, welche man durch Modifikation der MPMT-Nukleotidsequenz er- 
halt. Ziel einer solchen Modifikation kann z.B. die weitere Ein- 
grenzung der darin enthaltenen kodierenden Sequenz oder z.B. auch 
die Einfugung weiterer Restriktionsenzym-Schnittstellen sein. 



40 



Beispiel 8 beschreibt einen Deletionsklon des MPMT- Gens , siehe 
SEQ-ID Nr. 7) 



Funktionelle Aquivalente sind auch solche Varianten, deren Funk- 
45 tion, verglichen mit dem Ausgangsgen bzw. Genfragment, abge- 
schwacht oder verstarkt ist. 
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AuSerdem sind artif izielle DNA-Sequenzen geeignet, solange sie, 
wie oben beschrieben, die gewtinschte Eigenschaft beispielsweise 
der Erhdhung des Tocopherol-Gehaltes in der Pflanze durch Uber- 
expression eines MPMT-Gens in Kulturpf lanzen vermitteln. Solche 
5 artifiziellen DNA-Sequenzen kSnnen beispielsweise durch Riickuber- 
setzung mittels Molecular Modelling konstruierter Proteine, die 
MPMT-Aktivitat aufweisen Oder durch in vitro-Selektion ermittelt 
werden. Besonders geeignet sind kodierende DNA-Sequenzen, die 
durch Riickubersetzung einer Polypeptidsequenz gemaS der fiir die 
10 Wirtspflanze spezifischen Kodon-Nutzung erhalten wurden. Die spe- 
zifische Kodon-Nutzung kann ein mit pf lanzengenetischen Methoden 
vertrauter Fachmann durch Computerauswertungen anderer, bekannter 
Gene der zu trans formierenden Pflanze leicht ermitteln. 

15 Als weitere geeignete aquivalente Nukleinsaure-Sequenzen sind zu 
nennen Sequenzen, welche fur Fusionsproteine kodieren, wobei Be- 
standteil des Fusionsproteins ein MPMT-Polypeptid oder ein funk- 
tionell aquivalenter Teil davon ist. Der zweite Teil des Fusion- 
sproteins kann z.B. ein wei teres Polypeptid mit enzymatischer 

20 Aktivitat sein oder eine antigene Polypeptidsequenz mit deren 
Hilfe ein Nachweis auf MPMT-Expression moglich ist (z.B. myc-tag 
oder his-tag) . Bevorzugt handelt es sich dabei jedoch urn eine re- 
gulative Proteinsequenz, wie z.B. ein Transitpeptid, das das 
MPMT- Protein in die Plastiden leitet. 

25 

Erhohung des Gehaltes an Tocopherolen und Tocotrienolen bedeutet 
im Rahmen der vorliegenden Erfindung die kunstlich erworbene 
Fahigkeit einer erhShten Biosyntheseleistung dieser Verbindungen 
durch funktionelle Oberexpression eines MPMT-Gens SEQ-ID Nr. 1 
30 oder SEQ-ID Nr. 7 in der Pflanze gegeniiber der nicht gentechnisch 
modifizierten Pflanze fiir die Dauer mindestens einer Pflanzenge- 
neration. 

Dabei kann sowohl der Gehalt an Tocopherolen und Tocotrienolen 
35 gesteigert werden. Vorzugsweise wird der Gehalt an Tocopherolen 
gesteigert . Aber es ist auch moglich unter bestimmten Bedingungen 
vorzugsweise den Gehalt an Tocotrienolen zu steigern. 

Der Biosyntheseort von Tocopherolen beispielsweise ist unter an- 
40 derem das Blattgewebe, so daS eine blattspezif ische Expression 
des MPMT-Gens sinnvoll ist. Es ist jedoch naheliegend, dafi die 
Tocopherol-Biosynthese nicht auf das Blattgewebe beschrankt sem 
muS, sondern auch in alien iibrigen Teilen der Pflanze - besonders 
in fetthaltigen Samen - gewebespezifisch erfolgen kann. 

45 
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Dariiberhinaus ist eine konstitutive Expression des exogenen MPMT- 
Gens von vorteil. Andererseits kann aber auch eine induzierbare 
Expression wunschenswert erscheinen. 

5 Die Wirksamkeit der Expression des transgen exprimierten MPMT- 
Gens kann beispielsweise in vitro durch Sprofimeristemvermehrung 
ermittelt werden. Zudem kann eine in Art und H6he veranderte Ex- 
pression des MPMT-Gens und deren Auswirkung auf die Tocopherol- 
Biosyntheseleistung an Testpflanzen in Gewachshausversuchen gete- 
10 stet werden. 

Gegenstand der Erfindung sind auBerdem transgene Pflanzen, trans- 
formiert mit einer Expressionskassette enthaltend die Sequenz 
SEQ-ID Nr. 1 oder SEQ-ID Nr. 7 oder eine mit dieser hybridisie- 

15 rende bzw. zur Gesamtsequenz oder zu Teil sequenz en homologen DNA- 
Sequenz, sowie transgene Zellen, Gewebe, Teile und Vermehrungsgut 
solcher Pflanzen. Besonders bevorzugt sind dabei transgene Kul- 
turpflanzen, wie z.B. Gerste, Weizen, Roggen, Mais, Hafer, Soja, 
Reis, Baumwolle, Zuckerrube, Canola, Sonnenblume, Flachs, Hanf, 

20 Kartoffel, Tabak, Tomate, Raps, Alfalfa, Tagetes, Salat und die 
verschiedenen Baum-, NuE- und Weinspezies. 

Pflanzen im Sinne der Erfindung sind mono- und dikotyle Pflanzen. 

25 Gegenstand der Erfindung sind weiterhin photosynthetisch aktive 
Organismen transformiert mit einer Expressionskassette enthaltend 
die Sequenz SEQ-ID Nr. 1 oder SEQ-ID Nr. 7 oder eine mit dieser 
hybridisierende bzw. zur Gesamtsequenz oder zu Teilsequenzen ho- 
mologen DNA-Sequenz. Photosynthetisch aktive Organismen sind ne- 

30 ben Pflanzen, beispielsweise Cyanobakterien, Moose und Algen. 

Da es sich bei diesem Biosyntheseweg um einen ausschlieSlich pla- 
stidar-lokalisierten Stof fwechselweg handelt, bietet er optimale 
Targetenzyme fur die Entwicklung von Inhibitoren. Da sich nach 
35 heutigem Stand der Technik kein mit der Synechocystis MPMT iden- 
tisches oder ahnliches Enzym in humanen und tierischen Organismen 
befindet, ist davon auszugehen, daS Inhibitoren sehr spezifisch 
auf Pflanzen wirken sollten. 

40 Wie bereits erwahnt ist die MPMT ein potentielles Target fur 
Herbizide. Um effiziente Hemmstoffe der MPMT finden zu konnen, 
ist es notwendig, geeignete Testsysteme, mit denen Inhibitor-En- 
zym-Bindungsstudien durchgefiihrt werden konnen, zur Verfiigung zu 
stellen. Hierzu wird beispielsweise die komplette cDNA-Sequenz 

45 der MPMT aus Synechocystis in einen Expressionsvektor (pQE, Qia- 
gen) kloniert und in E. coli iiberexprimiert . 
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Das mit Hilfe der erf indungsgemafien Expressionskassette expri- 
mierte MPMT-Protein eignet sich besonders zur Auffindung von fur 
die MPMT spezifischen Hemmstoffen. 

5 Dazu kann die MPMT beispielsweise in einem Enzymtest eingesetzt 
werden, bei dem die Aktivitat der MPMT in An- und Abwesenheit 
des zu testenden Wirkstoffs ermittelt wird. Aus dem Vergleich 
der beiden Aktivitatsbestimmungen laSt sich eine qualitative und 
quantitative Aussage iiber das Hemmverhalten des zu testenden 
10 Wirkstoffes machen. 

Mit Hilfe des erf indungsgemSfien Testsystems kann eine Vielzahl 
von chemischen Verbindungen schnell und einfach auf herbizide 
Eigenschaften tiberpruft werden. Das Verfahren gestattet es, 
15 reproduzierbar aus einer grofien Anzahl von Substanzen gezielt 
solche mit grofier Wirkstarke auszuwahlen, urn mit diesen Sub- 
stanzen anschliefiend weitere, dem Fachmann gelaufige vertiefte 
Prufungen durchzufiihren. 

20 Ein weiterer Gegenstand der Erfindung sind Herbizide, die mit dem 
oben beschriebenen Testsystem identifizierbar sind. 

Durch Uberexpression der fiir eine MPMT kodierenden Gensequenz 
SEQ-ID Nr. 1 oder SEQ-ID Nr. 7 in einer Pflanze wird eine erhdhte 
25 Resistenz gegenuber Inhibitoren der MPMT erreicht. Die derart 
hergestellten transgenen Pflanzen sind ebenfalls Gegenstand der 
Erfindung. 

Das unter Verwendung der DNA-Sequenz SEQ-ID Nr. 1 oder SEQ-ID 
30 Nr. 7 hergestellte MPMT-Protein eignet sich auch zur Durchfuhrung 
von Biotrans format ionen zur Bereitstellung gr5Serer Mengen 
2,3-Dimethyl-6-phytylhydrochinon. Dabei wird 2-Methyl-6-phytylhy- 
drochinon in Gegenwart des Enzyms MPMT und des Cosubstrats S-Ade- 
nosyl-L-Methionin zu 2 , 3 -Dimethyl- 6 -phytylhydrochinon umgesetzt. 
35 Die Biotransformation lafit sich prinzipiell mit ganzen Zellen, 
die das Enzym MPMT exprimieren oder Zellextrakten aus diesen Zel- 
len oder aber mit aufgereinigter oder hochreiner MPMT in Gegen- 
wart von s-Adenosyl-L-Methionin.durchfvihren. 

40 Weitere Gegenstande der Erfindung sind: 

Verfahren zur Transformation einer Pflanze dadurch gekenn- 
zeichnet, daE man Expressionskassetten enthaltend eine DNA- 
Sequenz SEQ-ID Nr. 1 oder SEQ-ID Nr. 7 oder eine mit dieser 
45 hybridisierende, bzw. zur Gesamtsequenz oder zu Teilseguenzen 
homologen DNA-Sequenz in eine Pflanzenzelle oder Protoplasten 
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von Pflanzen einbringt und diese zu ganzen Pflanzen regene- 
riert . 

Verwendung der DNA-Sequenz SEQ-ID Nr. 1 oder SEQ-ID Nr. 7 
Oder eine mit dieser hybridisierende DNA-Sequenz zur Herstel- 
lung von Pflanzen mit erhShtem Gehalt an Tocopherolen und To- 
cotrienolen durch Expression einer MPMT DNA-Sequenz in Pflan- 



zen . 



10 Die Erfindung wird durch die nun folgenden Beispiele erl&utert, 
ist aber nicht auf diese beschr£nkt: 



Sequenzanalyse rekombinanter DNA 

15 Die Sequenzierung rekombinanter DNA-Molekiile erfolgte mit einem 
Laserfluoreszenz-DNA-Sequenzierer der Firma Licor (Vertrieb durch 
MWG Biotech, Ebersbach) nach der Methode von Sanger (Sanger 
et al., Proc. Natl. Acad. Sci. USA 74 (1977), 5463 - 5467). 



20 Beispiel 1 

Identifizierung einer 2-Methyl-6-phytylhydrochinon-methyltransf e- 
rase aus Synechocystis spec. PCC 6803. 

25 Die Klonierung und Identifizierung der 2-Methyl-6-phytylhydro- 
chinon-methyl transferase aus Synechocystis spec. PCC 6803 er- 
folgte f olgendermaSen : 

Unter Verwendung eines in S-Adenosyl-L-Methionin Me thyl trans f era- 
30 sen konservierten Sequenzmotivs , welches fiir die Bindung des 
S-Adenosyl-L-Methionin (SAM) verantwortlich ist (CP. Joshi und 
V.L. Chiang. PMB. 37: 663-374, 1998), wurde eine genomische DNA 
Datenbank von Synechocystis spec. PCC 6803 durchmustert (Kaneko 
et al., DNA Res. 34:109-136, 1996). Die bei der Durchmusterung 
35 identifizierten hypothetischen Proteine, welche tiber das SAM-Bin- 
demotiv verfugten, wurden mit den Primarsequenzen der Synechocy- 
stis spec. PCC 6803 y-Tocopherol-methyltransf erase (bezeichnet als 
slr0089) sowie der Arabidopsis thaliana y-Tocopherol-methyltrans- 
f erase (David Shintani und Dean DellaPerma. Sience. 
40 282:2098-2100,1998) verglichen. 

Dabei konnte ein hypothetisches Protein identif iziert werden (be- 
zeichnet sll0418 SEQ.-ID Nr. 2), welches geringe Ubereinstimmung 
in der Aminosauresequenz mit den y-Tocopherol-me thyl trans f erasen 
45 aus Synechocystis spec. PCC 6803 und Arabidopsis thaliana aufwies 
(36% bzw. 28% Identitat) . 
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Weitere Untersuchungen der Primarsequenz des hypothetischen Pro- 
teins sll0418 belegten das Vorkommen einer putativen prokaryonti- 
schen Signalsequenz innerhalb der ersten 20 Aminosauren (PSIGNAL, 
PC /GENE™ IntelliGenetics , Inc ©1991). Eine solche Sequenz konnte 
5 ebenfalls in der Synechocystis spec. PCC 6803 Y-Tocopherolmethyl- 
transf erase (slr0089) identif iziert werden (D. Shintani und D. 
DellaPenna. Sience. 282:2098-2100,1998) und deutet auf eine iden- 
tische Lokalisation der beiden Proteine hin. 

10 Das vorhergesagte Molekulargewicht des unprozessierten Proteins 
betragt 34,9 kDa und liegt damit in einem Bereich der auch fur 
die Synechocystis spec. PCC 6803 y-Tocopherolmethyltransf erase 
(David Shintani und Dean DellaPenna, Sience. 282:2098-2100,1998) 
und der aus Paprikafriichten gereinigten y-Tocopherolmethyltrans- 

15 f erase (d'Harlingue and Camara, Plastid enzymes of terpenoid bio- 
synthesis: Purification of y-Tocopherol Methyltransf erase from 
Capsicum Chromop lasts . Journal of Biological Chemistry, Vol. 269 
No. 28, 15200-152003,1985) ermittelt wurde. 

20 Unter Berttcksichtigung der Fakten, schlufcf olgerten wir, daS es 
sich bei dem hypothetischen Protein sll0418 urn eine Tocopherolme- 
thyl transferase handeln konnte. 

Bei spiel 2 

25 

Amplifikation und Klonierung der 2-Methyl-6-phytylhydrbchinon-me- 
thyltransf erase aus Synechocystis spec. PCC 6803 

Die DNA kodierend fur den ORF (open reading frame) sll0418 wurde 
30 mittels polymerase chain reaction (PCR) aus Synechocystis spec. 
PCC 6803 gemaS der Methode nach Crispin A. Howitt (BioTechniques 
21:32-34, July 1996) unter Verwendung eines sense spezifischen 
Primers (sll04185' Seq. Nr. 5) und eines antisense spezifischen 
Primers (sll04183' Seq. Nr. 6) amplif iziert. 



35 



Die PCR Bedingungen waren die folgenden: 

Die PCR erfolgte in einem 50ul Reaktionsansatz in dem enthalten 



war: 



40 



-5^ll einer Synechocystis spec. PCC 6803 Zellsuspension 
-0,2 mM dATP, dTTP, dGTP, dCTP 

-1,5 mM Mg(OAc)2 
-5[ig Rinderserum-Albumin 
45 -40pmol S1104185' 
-40pmol S1104183' 

-15ul 3,3x rTth DNA Polymerase XLPuffer (PE Applied Biosys terns) 
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-5U rTth DNA Polymerase XL (PE Applied Biosystems) 
Die PCR wurde unter folgenden Zyklusbedingungen durchgef uhrt : 
Schritt 1: 5 Minuten 94°C (Denaturierung) 
Schritt 2 : 3 Sekunden 94°C 
5 Schritt 3: 2 Minuten 58°C (Annealing) 
Schritt 4: 2 Minuten 72°C (Elongation) 
40 Wiederholungen der Schritte 2-4 
Schritt 5: 10 Minuten 72°C (Post-Elongation) 
Schritt 6: 4°C (Warteschleif e) 



10 



15 



Das Amplikon wurde unter Verwendung von Standardmethoden in den 
PCR Klonierungsvektor pGEM-T (Promega) kloniert. Die Identitat 
des erzeugten Amplikons wurde durch Sequenzierung unter 
Verwendung des M13F (-40) Primers bestatigt. 

Beispiel 3 

Erzeugung einer sll0418 Knock out Mutante 

20 Ein DNA Konstrukt zur Erzeugung einer Deletionsmutante des ORF 
S110418 in Synechocystis spec. PCC 6803 wurde unter Anwendung von 
Standard Klonierungstechniken erzeugt. 

Der Vektor pGEM-T/sll0418 wurde unter Verwendung des Restrikti- 

25 onsenzyms Ball verdaut. Das Vorhandensein von zwei Ball Schnitt- 
stellen innerhalb der sll0418 Sequenz (Position Bp 109 bzw Bp 
202) hatte den Verlust eines 93 Bp umfassenden internen Fragmen- 
tes zur Folge. In die Ball Schnittstellen des sll0418 ORF wurde 
die Aminogly c os id-3' Phosphotransferase des Transposons Tn903 klo- 

30 niert. Dazu wurde das Tn903 als EcoRl Fragment aus dem Vektor 

pUC4k (Vieira, J und Messing, J Gene: 19, 259-268, 1982) isoliert, 
die uberstehenden Enden des Restriktionsverdaus nach Standard- 
methoden in glatte Enden uberfiihrt und in den Ball geschnittenen 
Vektor pGEM-T/sll0418 ligiert. Der Ligationsansatz wurde zur 

35 Transformation von E.coli Xll blue Zellen verwendet. Transf orman- 
den wurden durch Verwendung von Kanamycin und Ampicillin selek- 
tioniert. Ein rekombinantes Plasmid ( P GEM-T/sll0418 : : tn903 ) wurde 
isoliert und zur Transformation von Synechocystis spec. PCC 6803 
gemafc der Methode nach Williams (Methods Enzymol. 167:776-778, 

40 1987) eingesetzt, 

Synechocystis spec. PCC 6803 Transf ormanden wurden selektioniert 
auf Kanamycin haltigem (kan) BG-11 Festmedium (Castenholz, 
Methods in Enzymology, Seite 68-93, 1988) bei 28°C und 30Mmol 
45 Photonen x (m 2 x s)" 1 . Vier unabhangige Knock out Mutanten konnten 
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nach fiinf Selektionsrunden (Passagen von Einzelkolonien auf fri- 
sches BG-llkn Medium) erzeugt werden. 

Der vollstandige Verlust des sll0418 Endogens bzw. der Austausch 
5 gegen die rekombinante sll0418 : : tn903 DNA, wurde durch PCR Analy- 
sen bestatigt. 

Beispiel 4 

10 Vergleich der Tocopherolproduktion in Synechocystis spec. PCC 
6803 Wildtypzellen und den erzeugten Knock out Mutanten des ORF 
s!10418. 

Die auf den BG-llkan Agarmedium kultivierten Zellen der vier un- 

15 abhangigen Synechocystis spec. PCC 6803 Knock out Mutanten des 
ORF sll0418 sowie untransf ormierte Wildtypzellen wurden zum 
Animpfen von Fliissigkulturen verwendet. Diese Kulturen wurden bei 
28°C und 30nmol Photonen x (m 2 x s)" 1 (30^E> fttr ca. 3 Tage kulti- 
viert. Nach Bestimmung der OD 73 o der einzelnen Kulturen, wurde die 

20 OD730 aller Kulturen durch entsprechende Verdiinnungen mit BG-11 
(Wildtypen) bzw. BG-llkan (Mutanten) synchronisiert . Diese auf 
Zelldichte synchronisierten Kulturen wurden zum Animpfen von drei 
Kulturen pro Mutante bzw. der Wildtypkontrollen verwendet. Die 
biochemischen Analysen konnten somit unter Verwendung von jeweils 

25 drei unabhangig gewachsenen Kulturen einer Mutante und der ent- 
sprechenden Wildtypen durchgefiihrt werden. Die Kulturen wurden 
bis zu einer optischen Dichte von 00730=0, 3 angezogen. Das Medium 
der Zellkultur wurde durch zweimalige Zentrifugation bei 14000 
rpm in einer Eppendorf Tischzentrifuge entfernt. Der daran 

30 anschliefiende AufschluS der Zellen erfolgte durch viermalige In- 
kubation im Eppendorf schttttler bei 30°C, lOOOrpm in 100% Methanol 
fiir 15 Minuten, wobei die jeweils erhaltenen Uberstande vereinigt 
wurden. Weitere Inkubationsschritte ergaben keine weitere Frei- 
setzung von Tocopherolen oder Tocotrienolen. 

35 

Urn Oxidation zu vermeiden, wurden die erhaltenen Extrakte direkt 
nach der Extraktion mit Hilfe einer Waters Allience 2690 HPLC-An- 
lage analysiert. Tocopherole und Tocotrienole wurden liber eine 
reverse Phase Saule( ProntoSil 200-3-C30, Bischoff) mit einer mo- 
40 bilen Phase von 100% Methanol getrennt und anhand von Standards 
(Merck) identif iziert . Als Detekt ions system diente die Fluores- 
zenz der Substanzen (Anregung 295nm, Emmision 320 nm) , die mit 
Hilfe eines Jasco Fluoreszensdetektors FP 920 nachgewiesen wurde. 



45 
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In den Synecchocystis spec. PCC 6803 knock out Mutanten des ORF 
sll0418 konnten keine Tocopherols und Tocotrienole gefunden wer- 
den. Tocopherole und Tocotrienole wurden jedoch in den Syneccho- 
cystis spec. PCC 6803 Wildtypzellen gemessen. 

5 

Der Verlust der Fahigkeit zur Produktion von Tocopherolen und To- 
cotrienolen'innerhalb der knock out Mutanten des ORF sll0418 im 
Vergleich zu den Synechocystis spec. PCC 6803 Wildtypzellen 
zeigt, daS das Gen sll0418 fur eine 2-Methyl-6-phytylhydrochinon- 
10 methyltransf erase kodiert. 

Beispiel 5 

Funktionelle Charakterisierung der 2-Methyl-6-phytylhydrochinon- 
15 methyltransf erase aus Synechocystis spec. PCC 6803 durch hetero- 
loge Expression in E.coli. 

Das hypothetische Protein sll0418 aus Synechocystis spec. PCC 
6803 konnte durch funktionelle Expression in E.coli als 
20 2-Methyl-6-phytylhydrochinon-methyltransferase identif iziert wer- 

den. 

Das aus Synechocystis spec. PCC 6803 amplif izierte Gen sll0418 
wurde im korrekten Leserahmen in den Expressionsvektor pQE-30 

25 (Qiagen) subkloniert. Die zur Amplif ikation des OFR sll0418 aus 
Synechocystis spec. PCC 6803 verwendeten Primer sll04185' bzw. 
sll04183' (Sequenz ID Nr. 5 und 6) waren so konstruiert, daS an 
das 5' Ende und das 3 # Ende des Amplikons BamHl Restriktions- 
schnittstellen addiert wurden, siehe Sequenz ID Nr . 3. Das 

30 sll0418 Fragment wurde unter Verwendung dieser f lankier enden 
BamHl Restriktionschnittstellen aus dem rekombinanten Plasmid 
pGEM-T/sll0418 isoliert und unter Anwendung von Standardmethoden 
in einen BamHl geschnittenen pQE-30 ligiert. Der Ligationsansatz 
wurde zur Transformation von M15 E.coli Zellen verwendet und 

35 Kanamycin und Ampicillin resistente Trans formanden wurden analy- 
siert. Die Kanamycin Resistenz wird durch das in den M15 Zellen 
enthaltene pREP-4 Plasmid vermittelt. Ein rekombinantes Plasmid 
(pQE-30/sll0418> welches das sll0418 Fragment in der richtigen 
Orientierung trug, wurde isoliert. Die Identitat und Orientie- 

40 rung des Inserts wurde durch Sequenzierung bestatigt. 

Das rekombinante Plasmid pQE-3 0/sll0418 wurde zur Transformation 
von M15 E.coli Zellen verwendet, urn rekombinantes sll0418 Pro- 
tein zu erzeugen. Unter Verwendung einer aus der Transformation 
45 hervorgegangenen Kolonie wurde eine Ubernachtkultur in Luria 

Broth Medium mit 200^g/ml Ampicillin (Amp) und SO^ig/ml Kanamycin 
(Kan) angeimpft. Ausgehend von dieser Kultur wurde am nSchsten 



20 



PCT/EPOO/05862 

WO 01/04330 2 0 

Morgen eine 100ml Luria Broth Kultur (Amp/Kan) angeimpft. Diese 
Kultur wurde bei 28°C auf einem Schiittelinkubator bis zum errei- 
chen einer OD 600 : 0, 35-0 , 4 inkubiert. AnschlieSend wurde die Pre- 
diction des rekombinanten Proteins durch Zugabe von 0,4 mM Iso- 
5 propyl-S-D-thiogalaktopyranosid (IPTG) induziert. Die Kultur 
wurde fiir weitere 3 Stunden bei 28°C geschuttelt und die Zellen 
anschlieSend durch Zentrifugation bei 8000g pelletiert. 

Das Pellet wurde in 600ul Lysispuffer (ca. 1-1,5 ml lq Pellet 
10 NaSgewicht, 10 mM HEPES KOH pH 7,8, 5 mM Dithiothreitol (DTT) , 
0 , 24 M Sorbitol ) resuspendiert . AnschlieSend wurde PMSF ( Phe- 
nylmethylsulfonat ) zu einer Endkonzentration von 0,15 mM beige- 
fiigt und der Ansatz fur 10 Minuten auf Eis gestellt. Der Auf- 
schluS der Zellen erfolgte durch einen 10 Sekunden Ultraschall- 
15 Puis unter Verwendung eines ultraschallstabes . Nach Zugabe von 
Triton X100 (Endkonzentration 0,1%) wurde die Zellsuspension fiir 
30 Minuten auf Eis inkubiert. Der Ansatz wurde anschlieSend fur 
30 Minuten bei 25000xg abzentrifugiert und der tiberstand zum 
Assay eingesetzt. 

Die Aktivitatsbestimmung der 2-Methyl-S-phytylhydrochinon-methyl- 
transferase erfolgt durch Nachweis des radioaktiv markierten Re- 
aktionsproduktes 2 , 3 -Dimethyl- 6-phytylhydrochinon . 

25 Dazu wurden 135ul des Enzyms (ca.300-600ug) zusammen mit 20ul Sub- 
strat (2-Methyl-6-phytylhydrochinon) und 15ul (0,46 mM SAM 14 C) 
Methylgruppendonor in folgendem Reaktionspuf fer : 200*11 (125mM) 
Tricine-NaOH pH 7 , 6 , lOOul (1,25 mM) Sorbitol, lOjll (50mM) MgCl 2 
und 20ul (250mM) Ascorbat fur 4 Stunden bei 25°C im Dunkeln 

30 inkubiert. 

Das Abstoppen der Reaktion erfolgte durch Zugabe von 750U1 Chlo- 
roform/Methanol (1:2) + 150ul 0,9% NaCl. Der gemischte Ansatz 
wurde kurz zentrifugiert und die obere Phase wurde verworfen. Die 

35 untere Phase wird in ein neues ReaktionsgefSS iiberfuhrt und unter 
Stickstoff eingedampft. Die Riickstande wurden in 20ul Ether auf- 
genommen und auf eine Diinnschicht-Platte zur chromatographischen 
Trennung der Substanzen aufgetragen (feste Phase: HPTLC-Platten: 
Kieselgel 60 F 254 (Merk) , fliissige Phase: Toluol). Der Nachweis 

40 des radioaktiv markierten Reaktionsproduktes erfolgt durch 
Verwendung eines Phosphoimagers . 

Diese Experimente bestatigten, daS es sich bei dem durch das Gen 
sll0418 (SEQ-ID Nr.l) aus Synechocystis spec. PCC 6803 kodierte 
45 Protein urn eine 2-Methyl-6-phytylhydrochinon-methyltransf erase 
handelt, da es die enzymatische Aktivitat zur Umwandlung von 
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2-Methyl-6-phytylhydrochinon in 2 , 3-Dimethyl-6-phytylhydrochinon 
besitzt . 

Abbildung 2 zeigt einen Sequenzvergleich auf Aminosaureebene zwi- 
5 schen den y-Tocopherolmethyltransf erasen aus Synechocystis spec. 
PCC Synechocystis spec . PCC 6803 (slr0089) und A. thaliana 
(aratmt) mit der 2-Methyl-6-phytylhydrochinon-methyltransf erase 
(S1104189) aus Synechocystis spec. PCC 6803. Die Ubereinstimmung 
mit den Y -Tocopherolmethyltransf erasen aus Synechocystis spec. PCC 
10 6803 und Arabisopsis thaliana betragt 36 bzw. 28 % Identitat. 

Beispiel 6 

Substratspezifitat der 2-Methyl-6-phytylhydrochinon-methyltrans- 
15 ferase 

Enzymatische Untersuchungen wie in Beispiel 5 durchgefuhrt bele- 
gen, dafi das Enzym MPMT - kodiert durch das Gen sll0418 (SEQ-ID 
Nr. 1) aus Synechocystis spec. PCC 6803 - 2-Methyl-6-phytylhydro- 
20 chinon in 2, 3-Dimethyl-6-phytylhydrochinon umwandelt. 

Zusatzlich besitzt das Enzym MPMT eine 2-Methyl-6-geranylgeranyl- 
hydrochinon-methyl transferase Aktivitat, wohingegen eine y-Toco- 
pherolmethyltransferase Aktivitat nicht nachgewiesen werden 

25 konnte. Somit ist belegt, daS das Enzym 2-Methyl-6-phytylhydro- 
chinon-methyl trans ferase an der Biosynthese der Tocotrienole be- 
teiligt ist, da es 2-Methyl-6-geranylgeranylhydrochinon zu 
2 ,3-Dimethyl-6-geranylgeranyl-hydrochinon umwandelt. Dies zeigt 
deutlich die Verschiedenheit der Enzymaktivitat der 

30 2-Methyl-6-phytylhydrochinon-methyltransferase im Vergleich zur 

•y-Tocopherolmethyltransf erase . 
Beispiel 1 

35 Herstellung von Expressionskassetten enthaltend das MPMT— Gen 

Transgene Pflanzen wurden erzeugt, die die 2-Methyl-6-phytylhy- 
drochinon-methyltransf erase aus Synechocystis spec. PCC6803 zum 
einen unter Kontrolle des konstitutiven 35S-Promotor des CaMV 

40 (Blumenkohlmosaikvirus) (Franck et al., Cell 21: 285-294, 1980) 
und zum anderen unter Kontrolle des samenspezif ischen Promoters 
des Legumin Gens aus Vicia faba (Kafatos et al . , Nuc. Acid. 
Res., 14(6): 2707-2720, 1986) exprimieren. Die Grundlage des zur 
konstitutiven Expression der 2-Methyl-6-phytylhydrochinon-methyl- 

45 transferase aus Synechocystis spec. PCC 6803 erzeugten Plasmides 
war der pBinAR-TkTp-9 (Ralf Badur , Dissertation Universitat Got- 
tingen, 1998). Dieser Vektor ist ein Derivat des pBinAR (Hofgen 
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und Willmitzer, Plant Sci. 66: 221-230, 1990) und enth&lt den 
35S-Promotor des CaMV (Blumenkohlmosaikvirus) (Franck et al., 
1980) , das Terminationssignal des Octopin-Synthase Gens (Gielen 
et al., EMBO J. 3: 835-846, 1984) sowie die far das Transitpeptid 
5 der plastidaren Nicotiana tabacum Transketolase kodierende DNA 
Sequenz (Ralf Badur, Dissertation University GSttingen, 1998). 
Die unter Beriicksichtigung des korrekten Leser asters erfolgte 
Klonierung der 2-Methyl-6-phytylhydrochinon-methyltransf erase aus 
Synechocystis spec. PCC6803 in diesen Vektor, erzeugt eine 
10 Translationsfusion der 2-Methyl-6-phytylhydrochinon-methyltrans- 
f erase mit dem plastidaren Transitpeptid. Dadurch erfolgt ein 
Transport des Transgens in die Plastiden. 

Zur Erstellung dieses Plasmides wurde das Gen sll0418 unter 
15 Verwendung der f lankierenden BamHI Restriktionsschnittstellen aus 
dem Plasmid pGEM-T/sll0418 isoliert. Dieses Fragment wurde unter 
Anwendung von Standardmethoden in einen BamHI geschnittenen pBi- 
nAR-TkTp-9 ligiert (siehe Abbildung 3). Dieses Plasmid (pBinAR- 
TkTp-9/sll0418) wurde zur Erzeugung transgener Arabidopsis tha- 
20 liana, Brassica napus und Nicotiana tabacum verwendet . Fragment A 
(529 bp) in Abbildung 3 beinhaltet den 35S-Promotor des CaMV 
(Nukleotide 6909 bis 7437 des Blumenkohlmosaikvirus), Fragment B 
(245bp) kodiert fur das Transitpeptid der Nicotiana tabacum 
Transketolase, Fragment C (977Bp) kodiert ORF sll0418 aus Syn- 
25 echocystis spec. PCC 6803, Fragment D (219Bp) kodiert ftir das 
Terminationssignal des Octopin-Synthase Gens. 

Zur Erzeugung eines Plasmides, welches die samenspezif ische Ex- 
pression der 2-Methyl-6-phytylhydrochinon-methyltransferase aus 

30 Synechocystis spec. PCC 6803 in Pflanzen ermoglicht, wurde der 
samenspezif iche Promoter des Legumin B4 Gens (Kafatos et al., 
Nuc. Acid. Res. ,14(6) :2707-2720, 1986) verwendet. Aus dem Plasmid 
pCR-Script/lePOCS wurde das 2,7 Kb Fragment des Legumin B4 Gen 
Promotors unter Verwendung der den Promotor 5' f lankierenden 

35 EcoRl und der 3' f lankierenden Kpnl Schnittstellen isoliert. Das 
Plasmid pBinAR-TkTp-9/sll0418 wurde ebenfalls mit den Restrikti- 
onsenzymen EcoRl und Kpnl behandelt. Dies hatte zur Folge, daS 
der 35S-Promotor des CaMV aus diesem Plasmid herausgetrennt 
wurde. Der Promotor des Legumin Gens wurde anschlieSend als 

40 EcoRl /Kpnl Fragment in diesen Vektor kloniert, wodurch ein 
Plasmid erzeugt wurde, welches die Expression des Gen sll0418 
unter die Kontrolle dieses samenspezif is chen Promotors stellte, 
siehe Abbildung 4. Dieses Plasmid ( P BinARleP-TkTp-9/sll0418) 
wurde zur Erzeugung transgener Arabidopsis thaliana, Brassica 

45 napus und Nicotiana tabacum Pflanzen verwendet. 
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Fragment A (2700 bp) in Abbildung 4 beinhaltet den Promoter des 
Legumin B4 Gens aus Vicia faba, Fragment B (245bp) kodiert fur 
das Transitpeptid der Nicotina tabacum Transketolase, Fragment C 
(977Bp) kodiert fur das ORF S110418 aus Synechocystis spec. PCC 
5 6803 , Fragment D (219Bp) fur das Terminationssignal des Octopin- 
Synthase Gens. 

Beispiel 8 

10 Herstellung von Expressionskassetten enthaltend einen Deletions- 
klon des MPMT-Gens 

Auf Grundlage einer Computeranalyse wurde in der Primarsequenz 
des ORF S110418 ein putatives prokaryontisches Sekretionssignal 

15 identifiziert. Urn sicherzustellen, daS dieses bei der Expression 
in Pflanzen keinen negativen EinfluS auf den Import des Proteins 
in die Plastiden nimmt, wurde ein Derivat der Seguenz des sll0418 
erzeugt, bei dem das putative Sekretionssignal deletiert wurde 
(Sequenz-ID Nr. 7) . Diese Deletion wurde unter Anwendung der PCR 

20 Technologie durchgefuhrt . Durch die dabei verwendeten Primer 
(sll0418DS P 5', Sequenz-ID Nr. 9 und sll0418DSp3 Sequenz-ID 
Nr. 10) wurde an das 5'Ende der Sequenz eine EcoRV Restriktions- 
schnittstelle und an das 3'Ende eine Sail Restriktionsschnitt- 
stelle addiert, durch die eine gerichtete Klonierung in den Vek- 

25 tor pBinAR-TkTp- 9 ermoglicht wurde. Das entstandene Plasmid pBi- 
nAR-TkTp-9/sll0418ASP ist in Abbildung 5 beschrieben. Fragment A 
(529 bp) in Abbildung 5 beinhaltet den 35S-Promotor des CaMV 
(Nukleotide 6909 bis 7437 des Blumenkohlmosaikvirus ) , Fragment B 
(245bp) Fragment kodiert fiir das Transitpeptid der Nicotiana 

30 tabacum Transketolase, Fragment C (930Bp) ORF sll0418ASP aus Syn- 
echocystis spec. PCC 6803 Fragment D (219Bp) fiir das Terminati- 
onssignal des Octopin-Synthase Gens. 

Zur Erzeugung eines Plasmides, welches die samenspezifische Ex- 
35 pression des Deletionsklons der 2-Methyl-6-phytylhydrochinon-me- 
thyltransf erase aus Synechocystis spec. PCC6803 in Pflanzen er- 
moglicht, wurde ebenfalls der bereits beschriebene samenspezif i- 
che Promotor des Legumin B4 Gens (Kafatos et al., Nuc. Acid. 
Res. ,14(6) :2707-2720, 1986) verwendet. Aus dem Plasmid PCR- 
40 Script /lePOCS wurde das 2,7 Kb Fragment des Legumin B4 Gen 
Promoters unter Verwendung der den Promotor 5' f lankierenden 
EcoRl und der 3 ' f lankierenden Kpnl Schnittstellen isoliert . Das 
Plasmid P BinAR-TkTp-9/sll0418ASP wurde ebenfalls xnit den Restrik- 
tionsenzymen EcoRl und Kpnl behandelt. Dies hatte zur Folge, dafi 
45 der 35S-Promotor des CaMV aus diesem Plasmid herausgetrennt 
wurde. Der Promotor des Legumin Gens wurde anschliefcend als 
EcoRl/Kpnl Fragment in diesen Vektor kloniert, wodurch ein 
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Plasmid erzeugt wurde, welches die Expression des Deletionsklons 
des Gen S110418 unter die Kontrolle dieses samenspezif ischen 
Promoters stellte, siehe Abbildung 6. Fragment A (2700 bp) in Ab- 
bildung 6 beinhaltet den Promotor des Legumin B4 Gens aus Vicia 
5 faba, Fragment B (245bp) Fragment kodiert fur das Trans itpept id 
der Nicotiana tabacum Transketolase , Fragment C (930Bp) ORF 
sll0418ASP aus Synechocystis spec. PCC 6803 Fragment D (219Bp) 
fur das Terminationssignal des Octopin-Synthase Gens. 

10 Dieses Plasmid ( P BinARleP-TkTp-9/sll0418ASP) wurde zur Erzeugung 
transgener Arabidopsis thaliana, Brassica napus und Nicotiana 
tabacum Pf lanzen verwendet . 

Auch durch Expression der DNA-Sequenz SEQ-ID Nr. 7 in transgenen 
15 Pf lanzen wurde eine Steigerung des Gehaltes an Tocopherol und To- 
cotrienol gemessen. 

Beispiel 9 

20 Herstellung transgener Arabidopsis thaliana Pflanzen 

Wildtyp Arabidopsis thaliana Pflanzen (Columbia) wurden mit dem 
Agrobacterium tumefaciens Stamm (EHA105) auf Grundlage einer 
modifizierten Vacuum Inf iltrationsmethode transformiert (Steve 

25 Clough und Andrew Bent, Floral dip: a simplified method for Agro- 
bacterium mediated transformation of Arabidopsis thaliana. Plant 
J. 16(6) . -735-43, 1998; Bechtold, N. , Ellis, J. und Pelltier, G. , 
in: Planta Agrobacterium-mediated gene transfer by infiltration 
of adult Arabidopsis thaliana plants. CRAcad Sci Paris, 1993. 

30 1144(2) :204-212) . Die verwendeten Agrobacterium tumefaciens Zel- 
len waren im Vorfeld mit den Plasmiden P BinARleP-TkTp-9/sll0418 
bzw. P BinAR-TkTp-9/sll0418 (Abbildung 3 und 4) transformiert wor- 
den . 

35 Samen der Pr imar trans formanden wurden auf Grundlage der Antibio- 
tikaresistenz selektioniert . Antibiotika resistente Keimlinge 
wurden in Erde gepflanzt und als vollentwickelte Pflanzen zur 
biochemischen Analyse verwendet. 

40 Beispiel 10 

Herstellung transgener Brassica napus Pflanzen 

Die Herstellung transgener Raps Pflanzen orientierte sich an 
45 einem Protokoll von Bade, J.B. und Damm,B. (in Gene Transfer to 
Plants, Potrykus, I. und Spangenberg, G., eds, Springer Lab Ma- 



PCT/EP00/05862 

WO 01/04330 2 5 

nual, Springer Verlag, 1995, 30-38), in welchem auch die 
Zusammensetzung der verwendeten Medien und Puffer angegeben ist. 

Die Trans format ionen erfolgten mit dem Agrobacterium tumefaciens 
5 Stamm EHA105. Zur Transformation wurden die Plasmide pBinARleP- 
TkTp-9/sll0418 bzw. P BinAR-TkTp-9/sll0418 verwendet. Samen von 
Brassica napus var. Westar wurden mit 70% Ethanol (v/v) oberfla- 
chensteril gemacht, 10 Minuten bei 55°C in Wasser gewaschen, in 
l%iger Hypochlorit-Lfisung (25% v/v Teepol,0,l% v/v Tween 20) fur 

10 20 Minuten inkubiert und sechsmal mit sterilem Wasser fur jeweils 
20 Minuten gewaschen. Die Samen wurden drei Tage auf Filterpapier 
getrocknet und 10-15 Samen in einem Glaskolben mit 15 ml Kei- 
mungsmedium zur Keimung gebracht. Von mehreren Keimlingen (ca. 10 
cm grofi) wurden die Wurzeln und Apices entfernt und die verblei- 

15 benden Hypokotyle in ca. 6 mm lange Stiicke geschnitten. Die so 
gewonnenen ca. 600 Explantate wurden 30 Minuten mit 50 ml Basal- 
medium gewaschen und in einen 300 ml Kolben uberfuhrt. Nach Zu- 
gabe von 100 ml Kallusinduktionsmedium wurden die Kulturen fur 24 
Stunden bei 100 U/min inkubiert. 

Vom Agrobacterium Stamm wurde eine Ubernachtkultur bei 29°C in Lu- 
ria Broth-Medium mit Kanamycin (20mg/l) angesetzt, davon 2 ml in 
50 ml Luria Broth-Medium ohne Kanamycin fur 4 Stunden bei 29°C bis 
zu einer OD 60 o von 0,4-0,5 inkubiert. Nach der Pelletierung der 
25 Kultur bei 2000 U/min fur 25 min wurde das Zellpellet in 25 ml 
Basalmedium resuspendiert . Die Konzentration der Bakterien in der 
Losung wurde durch Zugabe von weiterem Basalmedium auf eine OD 6 oo 
von 0 , 3 eingestellt . 

30 Aus den Raps-Explanten wurde das Kallus-Induktionsmedium mit ste- 
rilen Pipetten entfernt, 50 ml Agrobacterium-Ldsung hinzugeftigt, 
vorsichtig gemischt und fur 20 min inkubiert. Die Agrobacterien- 
Suspension wurde entfernt, die Raps-Explante fiir 1 min mit 50 ml 
Kallus-Induktionsmedium gewaschen und anschliefiend 100 ml Kallus- 

35 induktionsmedium hinzugeftigt. Die Co-Kultivierung wurde fiir 24 h 
auf einem Rotationsschiittler bei 100 U/min durchgefiihrt . Die Co- 
Kultivierung wurde durch Wegnahme des Kallus-Induktionsmediums 
gestoppt und die Explante zweimal fiir jeweils 1 min mit 25 ml und 
zweimal fiir 60 min mit jeweils 100 ml Waschmedium bei 100 U/min 

40 gewaschen. Das Waschmedium mit den Explanten wurde in 15 cm Pe- 
trischalen uberfuhrt und das Medium mit sterilen Pipetten ent- 
fernt . 

Zur Regeneration wurden jeweils 20-30 Explante in 90 mm Petri- 
45 schalen iiberfiihrt, welche 25 ml SproS- Induktionsmedium mit Kana- 
mycin enthielten. Die Petrischalen wurden mit 2 Lagen Leukopor 
verschlossen und bei 25 °C und 2000 lux bei Photoperioden von 16 



10 
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Stunden Licht/ 8 Stunden Dunkelheit inkubiert. Alle 12 Tage war- 
den die sich entwickelnden Kalli auf frische Petrischalen mit 
SproS-Induktionsmedium umgesetzt. Alle weiteren Schritte zur Re- 
generation ganzer Pflanzen wurden wie von Bade, J.B und Damm, B. 
(in: Gene Transfer to Plants, Potrykus, I. und Spangenberg, G., 
eds, Springer Lab Manual, Springer Verlag, 1995, 30-38) beschrie- 
ben durchgefuhrt. 

Beispiel 11 

Herstellung transgener Nicotiana tabacum Pflanzen 



Zehn ml YEB-Medium mit Antibiotikum (5 g/1 Rinder-Extrakt, 1 g/1 
Hef e-Extrakt , 5 g/1 Pepton, 5 g/1 Saccharose und 2 mM MgSo 4 ) wur- 

15 den mit einer Kolonie von Agrobacterium tumefaciens beimpf t und 
tiber Nacht bei 28°C kultiviert. Die Zellen wurden 20 min bei 4°C, 
3500 U/min in einer Tischzentrifuge pelletiert und danach in fri- 
schem YEB-Medium ohne Antibiotika unter sterilen Bedingungen 
resuspendiert. Die Zellsuspension wurde fur die Transformation 

20 eingesetzt. 

Die Wildtyp-Pflanzen aus Sterilkultur wurden durch vegetative Re- 
plikation erhalten. Dazu wurde nur die Spitze der Pflanze abge- 
schnitten und auf frisches 2MS-Medium in ein steriles Einweckglas 

25 iiberfuhrt. Vom Rest der Pflanze wurden die Haare auf der Blatto- 
berseite und die Mittelrippen der Blatter entfernt. Die Blatter 
wurden mit einer Rasierklinge in etwa 1 cm2 groSe Stiicke geschnit- 
ten. Die Agrobakterienkultur wurde in eine kleine Petrischale 
iiberfuhrt (Durchmesser 2 cm) . Die Blattstiicke wurden kurz durch 

30 die Losung gezogen und mit der Blattunterseite auf 2MS-Medium in 
Petrischalen (Durchmesser 9 cm) gelegt, so daS, sie das Medium be- 
ruhrten. Nach zwei Tagen im Dunkeln bei 25°C wurden die Explantate 
auf Platten mit Kallusinduktionsmedium iiberfiihrt und in der Kli- 
makammer auf 28°C temperiert. Das Medium muBte alle 7-10 Tage ge- 

35 wechselt werden. Sobald sich Kalli bildeten, wurden die Explan- 
tate in sterile Einweckglaser auf Sprofiinduktionsmedium mit Cla- 
foran (siehe oben) iiberfuhrt. Nach etwa einem Monat trat Organo- 
genese ein und die gebildeten Sprossen konnten abgeschnitten wer- 
den. Die Kultivierung der Sprosse wurde auf 2MS-Medium mit Clafo- 

40 ran und Selektionsmarker durchgefuhrt. Sobald sich ein kraf tiger 
Wurzelballen gebildet hatte, konnte die Pflanzen in Pikiererde 
get op ft werden. 



Beispiel 12 

45 

Charakterisierung der transgenen Pflanzen 
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Um zu bestatigen, daS durch die Expression der 2-Methyl-6-phytyl- 
hydrochinon-methyl transferase aus Synechocystis spec. PCC 6803 
die Vitamin E Biosynthese in den transgenen Pflanzen gesteigert 
wird, wurden die Tocopherol- und Tocotrienol-Gehalte in Blatter 
5 und Samen der mit den Konstrukten P BinARleP-TkTp-9/sll0418 bzw. 
pBinAR-TkTp-9/sll0418 Pflanzen (Arabidopsis thaliana, Brassica 
napus und Nicotiana tabacum) analysiert . Dazu wurden die trans- 
genen Pflanzen im Gewachshaus kultiviert und Pflanzen die das Gen 
kodierend fur die 2-Methyl-6-phytylhydrochinon-methyltransferase 

10 aus Synechocystis spec. PCC 6803 exprimieren auf Northern-Ebene 
analysiert . In Blattern und Samen dieser Pflanzen wurde der Toco- 
pherolgehalt und der Tocotrienolgehalt ermittelt. In alien Fallen 
war die Tocopherol- bzw. Tocotrienol-Konzentration in transgenen 
Pflanzen, die zusatzlich eine DNA-Sequenz SEQ-ID Nr. 1 oder 

15 SEQ-ID Nr. 7 exprimieren, im Vergleich zu nicht transf ormierten 
Pflanzen erh6ht. 



20 



25 



30 



35 



40 



45 
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Patentanspriiche 

1. DNA-Sequenz SEQ-ID Nr. 1 oder SEQ-ID Nr. 7 und mit dieser hy- 
5 bridisierende oder zur Gesamtsequenz oder zu Teilsequenzen 

homologen DNA-Sequenz kodierend fur eine 2-Methyl-6-phytylhy- 
drochinon-methyltransf erase aus Synechocystis . 

2. Verwendung von DNA-Sequenzen codierend fiir eine 

10 2-Methyl-6-phytylhydrochinon-methyltransferase zur Herstel- 

lung von Pflanzen und photosynthetisch aktiven Organismen mit 
erh6htem Gehalt an Tocopherolen und Tocotrienolen. 

3. Verwendung einer DNA-Sequenz SEQ-ID Nr. 1 oder SEQ-ID Nr. 7 
15 oder einer mit dieser hybridisierenden DNA-Sequenz kodierend 

fur eine 2-Methyl-6-phytylhydrochinon-methyl transferase zur 
Herstellung von Pflanzen und photosynthetisch aktiven Orga- 
nismen mit erhohtem Gehalt an Tocopherolen und Tocotrienolen. 

20 4. Verfahren zur Herstellung von Pflanzen und photosynthetisch 
aktiven Organismen mit erhohtem Gehalt an Tocopherolen und 
Tocotrienolen dadurch gekennzeichnet , da& eine DNA-Sequenz 
SEQ-ID Nr. 1 oder SEQ-ID Nr. 7 oder eine mit dieser hybridi- 
sierende oder zur Gesamtsequenz oder zu Teilsequenzen homolo- 

25 gen DNA-Sequenz in Pflanzen und photosynthetisch aktiven Or- 

ganismen exprimiert wird. 

5. Verfahren zur Transformation einer Pflanze dadurch gekenn- 
zeichnet, daS man eine Expressionskassette enthaltend einen 
30 Promotor, eine Signalsequenz, eine DNA-Sequenz SEQ-ID Nr. 1 

oder SEQ-ID Nr. 7 und einen Terminator oder eine mit dieser 
hybridisierende DNA-Sequenz in eine Pf lanzenzelle, in Kallus- 
gewebe, eine ganze Pflanze oder Protoplasten von Pflanzenzel- 
len einbringt. 



35 



40 



Verfahren zur Transformation von Pflanzen gemSS Anspruch 5, 
dadurch gekennzeichnet, daS die Transformation mit Hilfe des 
Stammes Agrobacterium tumefaciens, der Elektroporation oder 
der particle bombardment Methode erfolgt. 

Pflanze mit erhohtem Gehalt an Tocopherolen und Tocotrienolen 
enthaltend eine Expressionskassette gemaS Anspruch 5. 



45 



Zeichn 
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8. Pflanze nach Anspruch 7, ausgewahlt aus der Gruppe Soja, Ca- 
nola, Gerste, Hafer, Weizen, Raps, Mais, Roggen, Tagetes oder 
Sonnenblume . 

5 9. Verwendung der DNA-Sequenz SEQ-ID Nr. 1 oder SEQ-ID Nr. 7 

oder einer mit dieser hybridisierende DNA-Sequenz gemafi An- 
spruch 1 zur Herstellung eines Testsystems zur Identifizie- 
rung von Inhibit oren der 2-Methyl-6-phytylhydrochinon-methyl- 
transf erase. 



10 



10 



Testsystem basierend auf der Expression der DNA-Sequenz 
SEQ-ID Nr. 1 oder SEQ-ID Nr. 7 oder einer mit dieser hybridi- 
sierende DNA-Sequenz gemaS Anspruch 1 zur Identif izierung von 
Inhibitoren der 2-Methyl-6-phytylhydrochinonmethyltransf e- 

15 rase . 
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SEQUENZ PROTOKOLL 



<110> SunGene GmbH & Co.KGaA 

<120> ueberexpression einer DNA-Sequenz codierend fuer eine 
2-Methyl-phytylhydrochinon-methyltransf erase in 

Pf lanzen. 



<130> MPMTSynechocystis 

<140> 
<141> 



<160> 10 



<170> Patentln Vers. 2.0 



<210> 1 
<211> 957 
<212> DNA 

<213> Synechocystis PCC6803 



<220> 

<221> CDS 

<222> (1) . . (957) 



<400> 1 

atg ccc gag tat 
Met Pro Glu Tyr 
1 

gcg ate gec get 
Ala He Ala Ala 

20 

teg gat tec gtg 
Ser Asp Ser Val 
35 

ttg gaa tat tac 
Leu Glu Tyr Tyr 
50 

ccg cca gtg gec 
Pro Pro Val Ala 
65 

gee atg gee cag 
Ala Met Ala Gin 



ttg ctt ctg 
Leu Leu Leu 
5 

gga ctg tat 
Gly Leu Tyr 



gee aac gec 
Ala Asn Ala 



tgg ggc gac 
Trp Gly Asp 
55 

aag gat ttc 
Lys Asp Phe 
70 

tgg ggc gga 
Trp Gly Gly 



ccc get ggc 
Pro Ala Gly 
10 

etc eta act 
Leu Leu Thr 
25 

tac gac caa 
Tyr Asp Gin 
40 

cat ate cac 
His He His 



ate caa teg 
He Gin Ser 



tta gat aca 
Leu Asp Thr 



eta att tec etc 
Leu He Ser Leu 



gec egg ggc tat 
Ala Arg Gly Tyr 

30 

tgg aca gag gac 
Trp Thr Glu Asp 
45 

etc ggc cat tat 
Leu Gly His Tyr 
60 

aaa att gat ttt 
Lys He Asp Phe 
75 

ctt ccc ccc ggc 
Leu Pro Pro Gly 



tec ctg 48 
Ser Leu 
15 

cag tea 96 
Gin Ser 



ggc att 144 
Gly He 

ggc gat 192 
Gly Asp 

gtc cat 240 
Val His 
80 

aca acg 288 
Thr Thr 
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85 90 95 



gta ttg gat gtg ggt tgc ggc att ggc ggt age agt cgc att etc gec 
Val Leu Asp Val Gly Cys Gly He Gly Gly Ser Ser Arg He Leu Ala 

100 105 HO 

aaa gat tat ggt ttt aac gtt acc ggc ate ace att agt ccc caa cag 
Lys Asp Tyr Gly Phe Asn Val Thr Gly He Thr He Ser Pro Gin Gin 
115 120 " 125 

gtg aaa egg gcg acg gaa tta act cct ccc gat gtg acg gee aag ttt 
val Lys Arg Ala Thr Glu Leu Thr Pro Pro Asp Val Thr Ala Lys Phe 
130 135 140 

gcg gtg gac gat get atg get ttg tct ttt cct gac ggt agt ttc gac 
Ala Val Asp Asp Ala Met Ala Leu Ser Phe Pro Asp Gly Ser Phe Asp 
145 150 155 160 

gta gtt tgg teg gtg gaa gca ggg ccc cac atg cct gac aaa get gtg 
Val Val Trp Ser Val Glu Ala Gly Pro His Met Pro Asp Lys Ala Val 

165 170 175 

ttt gee aag gaa tta ctg egg gtc gtg aaa cca ggg ggc att ctg gtg 
Phe Ala Lys Glu Leu Leu Arg Val Val Lys Pro Gly Gly He Leu Val 

180 185 190 

gtg gcg gat tgg aat caa egg gac gat cgc caa gtg ccc etc aac ttc 
Val Ala Asp Trp Asn Gin Arg Asp Asp Arg Gin Val Pro Leu Asn Phe 
195 200 205 

tgg gaa aaa cca gtg atg cga caa ctg ttg gat caa tgg tec cac cct 
Trp Glu Lys Pro Val Met Arg Gin Leu Leu Asp Gin Trp Ser His Pro 
210 215 220 

gec ttt gee age att gaa ggt ttt gcg gaa aat ttg gaa gee acg ggt 
Ala Phe Ala Ser He Glu Gly Phe Ala Glu Asn Leu Glu Ala Thr Gly 
225 230 235 240 

ttg gtg gag ggc cag gtg act act get gat tgg act gta ccg acc etc 
Leu Val Glu Gly Gin Val Thr Thr Ala Asp Trp Thr Val Pro Thr Leu 

245 250 255 

ccc get tgg ttg gat acc att tgg cag ggc att ate egg ccc cag ggc 
Pro Ala Trp Leu Asp Thr He Trp Gin Gly He He Arg Pro Gin Gly 

260 265 270 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



tgg tta caa tac ggc att cgt ggg ttt ate aaa tec gtg egg gaa gta 
Trp Leu Gin Tyr Gly He Arg Gly Phe He Lys Ser Val Arg Glu Val 
275 280 285 



864 



V 

V 
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ccg act att tta ttg atg cgc ctt gcc ttt ggg gta gga ctt tgt cgc 912 

Pro Thr lie Leu Leu Met Arg Leu Ala Phe Gly Val Gly Leu Cys Arg 
290 295 300 

ttc ggt atg ttc aaa gca gtg cga aaa aac gcc act caa get taa 957 

Phe Gly Met Phe Lys Ala Val Arg Lys Asn Ala Thr Gin Ala 
305 310 315 



<210> 2 
<211> 318 
<212> PRT 

<213> Synechocystis PCC6803 
<400> 2 

Met Pro Glu Tyr Leu Leu Leu Pro Ala Gly Leu lie Ser Leu Ser Leu 
15 10 15 

Ala lie Ala Ala Gly Leu Tyr Leu Leu Thr Ala Arg Gly Tyr Gin Ser 

20 25 30 

Ser Asp Ser Val Ala Asn Ala Tyr Asp Gin Trp Thr Glu Asp Gly He 
35 40 45 

Leu Glu Tyr Tyr Trp Gly Asp His He His Leu Gly His Tyr Gly Asp 
50 55 60 

Pro Pro Val Ala Lys Asp Phe He Gin Ser Lys He Asp Phe Val His 
65 70 75 80 

Ala Met Ala Gin Trp Gly Gly Leu Asp Thr Leu Pro Pro Gly Thr Thr 

85 90 95 

Val Leu Asp Val Gly Cys Gly He Gly Gly Ser Ser Arg He Leu Ala 

100 105 HO 

Lys Asp Tyr Gly Phe Asn Val Thr Gly He Thr lie Ser Pro Gin Gin 
115 120 125 

Val Lys Arg Ala Thr Glu Leu Thr Pro Pro Asp Val Thr Ala Lys Phe 
130 135 140 

Ala Val Asp Asp Ala Met Ala Leu Ser Phe Pro Asp Gly Ser Phe Asp 
145 150 155 160 



Val Val Trp Ser Val Glu Ala Gly Pro His Met Pro Asp Lys Ala Val 

165 170 175 
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Phe Ala Lys Glu Leu Leu Arg Val Val Lys Pro Gly Gly He Leu Val 

180 185 190 

Val Ala Asp Trp Asn Gin Arg Asp Asp Arg Gin Val Pro Leu Asn Phe 
195 200 205 

Trp Glu Lys Pro Val Met Arg Gin Leu Leu Asp Gin Trp Ser His Pro 
210 215 220 

Ala Phe Ala Ser He Glu Gly Phe Ala Glu Asn Leu Glu Ala Thr Gly 
225 230 235 240 

Leu Val Glu Gly Gin Val Thr Thr Ala Asp Trp Thr Val Pro Thr Leu 

245 250 255 

Pro Ala Trp Leu Asp Thr He Trp Gin Gly He He Arg Pro Gin Gly 

260 265 270 

Trp Leu Gin Tyr Gly He Arg Gly Phe He Lys Ser Val Arg Glu Val 
275 280 285 

Pro Thr He Leu Leu Met Arg Leu Ala Phe Gly Val Gly Leu Cys Arg 
290 295 300 

Phe Gly Met Phe Lys Ala Val Arg Lys Asn Ala Thr Gin Ala 
305 310 315 



48 



<210> 3 
<211> 974 
<212> DNA 

<213> Synechocystis PCC6803 

<220> 
<221> CDS 
<222> (7) (963) 

<400> 3 

ggatcc atg ccc gag tat ttg ctt ctg ccc get ggc eta att tec etc 
Met Pro Glu Tyr Leu Leu Leu Pro Ala Gly Leu He Ser Leu 
1 5 10 

tec ctg gcg ate gee get gga ctg tat etc eta act gee egg ggc tat 
Ser Leu Ala He Ala Ala Gly Leu Tyr Leu Leu Thr Ala Arg Gly Tyr 
15 20 25 30 

cag tea teg gat tec gtg gee aac gee tac gac caa tgg aca gag gac 144 
Gin Ser Ser Asp Ser Val Ala Asn Ala Tyr Asp Gin Trp Thr Glu Asp 

35 40 45 



96 



\ 

\ 
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ggc att ttg 
Gly He Leu 



ggc gat ccg 
Gly Asp Pro 
65 

gtc cat gcc 
Val His Ala 
80 

aca acg gta 
Thr Thr Val 
95 

etc gcc aaa 
Leu Ala Lys 



caa cag gtg 
Gin Gin Val 



aag ttt gcg 
Lys Pile Ala 
145 

ttc gac gta 
Phe Asp Val 
160 

get gtg ttt 
Ala Val Phe 
175 

ctg gtg gtg 
Leu Val Val 



aac ttc tgg 
Asn Phe Trp 



cac cct gcc 
His Pro Ala 
225 



gaa tat tac tgg 
Glu Tyr Tyr Trp 
50 

cca gtg gcc aag 
Pro Val Ala Lys 



atg gcc cag tgg 
Met Ala Gin Trp 

85 

ttg gat gtg ggt 
Leu Asp Val Gly 
100 

gat tat ggt ttt 
Asp Tyr Gly Phe 
115 

aaa egg gcg acg 
Lys Arg Ala Thr 
130 

gtg gac gat get 
Val Asp Asp Ala 



gtt tgg teg gtg 
Val Trp Ser Val 

165 

gcc aag gaa tta 
Ala Lys Glu Leu 
180 

gcg gat tgg aat 
Ala Asp Trp Asn 
195 

gaa aaa cca gtg 
Glu Lys Pro Val 
210 

ttt gcc age att 
Phe Ala Ser He 



ggc gac cat ate 
Gly Asp His He 
55 

gat ttc ate caa 
Asp Phe He Gin 
70 

ggc gga tta gat 
Gly Gly Leu Asp 

tgc ggc att ggc 
Cys Gly He Gly 

105 

aac gtt acc ggc 
Asn Val Thr Gly 
120 

gaa tta act cct 
Glu Leu Thr Pro 
135 

atg get ttg tct 
Met Ala Leu Ser 
150 

gaa gca ggg ccc 
Glu Ala Gly Pro 



ctg egg gtc gtg 
Leu Arg Val Val 

185 

caa egg gac gat 
Gin Arg Asp Asp 
200 

atg cga caa ctg 
Met Arg Gin Leu 
215 

gaa ggt ttt gcg 
Glu Gly Phe Ala 
230 



cac etc ggc 
His Leu Gly 
60 

teg aaa att 
Ser Lys lie 
75 

aca ctt ccc 
Thr Leu Pro 
90 

ggt age agt 
Gly Ser Ser 



ate acc att 
lie Thr He 

ccc gat gtg 

Pro Asp Val 
140 

ttt cct gac 
Phe Pro Asp 
155 

cac atg cct 
His Met Pro 
170 

aaa cca ggg 
Lys Pro Gly 



cgc caa gtg 
Arg Gin Val 



ttg gat caa 
Leu Asp Gin 
220 

gaa aat ttg 
Glu Asn Leu 
235 



cat tat 192 
His Tyr 

gat ttt 240 
Asp Phe 

ccc ggc 288 
Pro Gly 

cgc att 336 
Arg He 
110 

agt ccc 384 
Ser Pro 
125 

acg gcc 432 
Thr Ala 

ggt agt 480 
Gly Ser 

gac aaa 528 
Asp Lys 

ggc att 576 
Gly He 
190 

ccc etc 624 
Pro Leu 
205 

tgg tec 672 
Trp Ser 



gaa gcc 720 
Glu Ala 
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acg ggt ttg gtg gag ggc cag gtg act act get gat tgg act gta ccg 768 
Thr Gly Leu Val Glu Gly Gin Val Thr Thr Ala Asp Trp Thr Val Pro 
240 245 250 

acc etc ccc get tgg ttg gat ace att tgg cag ggc att ate egg ccc 816 
Thr Leu Pro Ala Trp Leu Asp Thr He Trp Gin Gly He He Arg Pro 
255 260 265 270 

cag ggc tgg tta caa tac ggc att cgt ggg ttt ate aaa tec gtg egg 864 
Gin Gly Trp Leu Gin Tyr Gly He Arg Gly Phe He Lys Ser Val Arg 

275 280 285 

gaa gta ccg act att tta ttg atg cgc ctt gec ttt ggg gta gga ctt 
Glu Val Pro Thr He Leu Leu Met Arg Leu Ala Phe Gly Val Gly Leu 

290 295 300 

tgt cgc ttc ggt atg ttc aaa gca gtg cga aaa aac gee act caa get 
Cys Arg Phe Gly Met Phe Lys Ala Val Arg Lys Asn Ala Thr Gin Ala 
305 310 315 

taa attgeggate c 974 



912 



960 



<210> 4 
<211> 318 
<212> PRT 

<213> Synechocystis PCC6 803 
<400> 4 

Met Pro Glu Tyr Leu Leu Leu Pro Ala Gly Leu He Ser Leu Ser Leu 
15 10 15 

Ala He Ala Ala Gly Leu Tyr Leu Leu Thr Ala Arg Gly Tyr Gin Ser 

20 25 30 

Ser Asp Ser Val Ala Asn Ala Tyr Asp Gin Trp Thr Glu Asp Gly He 
35 40 45 

Leu Glu Tyr Tyr Trp Gly Asp His He His Leu Gly His Tyr Gly Asp 
50 55 60 

Pro Pro Val Ala Lys Asp Phe He Gin Ser Lys He Asp Phe Val His 
65 70 75 80 

Ala Met Ala Gin Trp Gly Gly Leu Asp Thr Leu Pro Pro Gly Thr Thr 

85 90 95 

Val Leu Asp Val Gly Cys Gly He Gly Gly Ser Ser Arg He Leu Ala 
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110 



Lys Asp Tyr Gly 
115 

Val Lys Arg Ala 
130 

Ala Val Asp Asp 
145 

Val Val Trp Ser 



Phe Ala Lys Glu 

180 

Val Ala Asp Trp 
195 

Trp Glu Lys Pro 
210 

Ala Phe Ala Ser 
225 

Leu Val Glu Gly 



Pro Ala Trp Leu 

260 

Trp Leu Gin Tyr 
275 

Pro Thr lie Leu 
290 

Phe Gly Met Phe 
305 



Phe Asn Val Thr 

12 0 

Thr Glu Leu Thr 
135 

Ala Met Ala Leu 
150 

Val Glu Ala Gly 
165 

Leu Leu Arg Val 



Asn Gin Arg Asp 

200 

Val Met Arg Gin 
215 

He Glu Gly Phe 
230 

Gin Val Thr Thr 
245 

Asp Thr He Trp 



Gly He Arg Gly 

280 

Leu Met Arg Leu 
295 

Lys Ala Val Arg 
310 



Gly He Thr He 



Pro Pro Asp Val 

140 

Ser Phe Pro Asp 
155 

Pro His Met Pro 
170 

Val Lys Pro Gly 
185 

Asp Arg Gin Val 



Leu Leu Asp Gin 

220 

Ala Glu Asn Leu 
235 

Ala Asp Trp Thr 
250 

Gin Gly He He 
265 

Phe He Lys Ser 



Ala Phe Gly Val 

300 

Lys Asn Ala Thr 
315 



Ser Pro Gin Gin 
125 

Thr Ala Lys Phe 



Gly Ser Phe Asp 

160 

Asp Lys Ala Val 
175 

Gly He Leu Val 
190 

Pro Leu Asn Phe 
205 

Trp Ser His Pro 



Glu Ala Thr Gly 

240 

Val Pro Thr Leu 
255 

Arg Pro Gin Gly 
270 

Val Arg Glu Val 
285 

Gly Leu Cys Arg 



Gin Ala 



<210> 5 
<211> 27 
<212> DNA 

<213> Synechocystis PCC6803 
<220> 

<221> primer__bind 
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<400> 5 

ggatccatgc ccgagtattt gcttctg 27 



<210> 6 
<211> 26 
<212> DNA 

<213> Synechocystis PCC6803 
<220> 

<221> primer__bind 
<222> (1) . - (26) 

<400> 6 

ggatccgcaa tttaagcttg agtggc 26 



<210> 7 
<211> 930 
<212> DNA 

<213> Synechocystis PCC6803 

<220> 

<221> CDS 

<222> (10) . . (915) 

<400> 7 

gatatcacc atg gcc get gga ctg tat etc eta act gee egg ggc tat cag 51 

Met Ala Ala Gly Leu Tyr Leu Leu Thr Ala Arg Gly Tyr Gin 
1 5 10 



tea teg gat tec gtg gcc aac gcc tac gac caa tgg aca gag gac ggc 
Ser Ser Asp Ser Val Ala Asn Ala Tyr Asp Gin Trp Thr Glu Asp Gly 
15 20 25 30 

att ttg gaa tat tac tgg ggc gac cat ate cac etc ggc cat tat ggc 
lie Leu Glu Tyr Tyr Trp Gly Asp His lie His Leu Gly His Tyr Gly 

35 40 45 

gat ccg cca gtg gcc aag gat ttc ate caa teg aaa att gat ttt gtc 
Asp Pro Pro Val Ala Lys Asp Phe He Gin Ser Lys He Asp Phe Val 

50 55 60 

cat gcc atg gcc cag tgg ggc gga tta gat aca ctt ccc ccc ggc aca 
His Ala Met Ala Gin Trp Gly Gly Leu Asp Thr Leu Pro Pro Gly Thr 
65 70 75 



99 



147 



195 



243 
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acg gta ttg gat gtg ggt tgc ggc att ggc ggt age agt cgc att etc 
Thr Val Leu Asp Val Gly Cys Gly lie Gly Gly Ser Ser Arg lie Leu 
80 85 90 

gec aaa gat tat ggt ttt aac gtt acc ggc ate ace att agt ccc caa 
Ala Lys Asp Tyr Gly Phe Asn Val Thr Gly He Thr He Ser Pro Gin 
95 100 105 HO 

cag gtg aaa egg gcg acg gaa tta act cct ccc gat gtg acg gec aag 
Gin Val Lys Arg Ala Thr Glu Leu Thr Pro Pro Asp Val Thr Ala Lys 

115 120 125 



ggc tgg tta caa tac ggc att cgt ggg ttt ate aaa tec gtg egg gaa 

Gly Trp Leu Gin Tyr Gly He Arg Gly Phe He Lys Ser Val Arg Glu 

255 260 265 270 

gta ccg act att tta ttg atg cgc ctt gee ttt ggg gta gga ctt tgt 



291 



339 



387 



ttt gcg gtg gac gat get atg get ttg tct ttt cct gac ggt agt ttc 435 
Phe Ala Val Asp Asp Ala Met Ala Leu Ser Phe Pro Asp Gly Ser Phe 

130 135 140 

gac gta gtt tgg teg gtg gaa gca ggg ccc cac atg cct gac aaa get 483 
Asp Val Val Trp Ser Val Glu Ala Gly Pro His Met Pro Asp Lys Ala 
145 150 155 

gtg ttt gee aag gaa tta ctg egg gtc gtg aaa cca ggg ggc att ctg 531 
Val Phe Ala Lys Glu Leu Leu Arg Val Val Lys Pro Gly Gly He Leu 
160 165 170 

gtg gtg gcg gat tgg aat caa egg gac gat cgc caa gtg ccc etc aac 579 
Val Val Ala Asp Trp Asn Gin Arg Asp Asp Arg Gin Val Pro Leu Asn 
175 180 185 190 

ttc tgg gaa aaa cca gtg atg cga caa ctg ttg gat caa tgg tec cac 
Phe Trp Glu Lys Pro Val Met Arg Gin Leu Leu Asp Gin Trp Ser His 

195 200 205 

cct gee ttt gec age att gaa ggt ttt gcg gaa aat ttg gaa gee acg 
Pro Ala Phe Ala Ser He Glu Gly Phe Ala Glu Asn Leu Glu Ala Thr 

210 215 220 

ggt ttg gtg gag ggc cag gtg act act get gat tgg act gta ccg acc 
Gly Leu Val Glu Gly Gin Val Thr Thr Ala Asp Trp Thr Val Pro Thr 
225 230 235 

etc ccc get tgg ttg gat acc att tgg cag ggc att ate egg ccc cag 771 
Leu Pro Ala Trp Leu Asp Thr He Trp Gin Gly He He Arg Pro Gin 
240 245 250 



627 



675 



723 



819 



867 
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Val Pro Thr lie Leu Leu Met Arg Leu Ala Phe Gly Val Gly Leu Cys 

275 280 285 

cgc ttc ggt atg ttc aaa gca gtg cga aaa aac gcc act caa get taa 915 
Arg Phe Gly Met Phe Lys Ala Val Arg Lys Asn Ala Thr Gin Ala 

290 295 300 

attcttaagg tcgac 930 



<210> 8 
<211> 301 
<212> PRT 

<213> Synechocystis PCC6803 
<400> 8 

Met Ala Ala Gly Leu Tyr Leu Leu Thr Ala Arg Gly Tyr Gin Ser Ser 
1 5 10 15 

Asp Ser Val Ala Asn Ala Tyr Asp Gin Trp Thr Glu Asp Gly He Leu 

20 25 30 

Glu Tyr Tyr Trp Gly Asp His He His Leu Gly His Tyr Gly Asp Pro 
35 40 45 

Pro Val Ala Lys Asp Phe He Gin Ser Lys He Asp Phe Val His Ala 
50 55 60 

Met Ala Gin Trp Gly Gly Leu Asp Thr Leu Pro Pro Gly Thr Thr Val 
65 70 75 80 

Leu Asp Val Gly Cys Gly He Gly Gly Ser Ser Arg He Leu Ala Lys 

85 90 95 

Asp Tyr Gly Phe Asn Val Thr Gly He Thr He Ser Pro Gin Gin Val 

100 105 HO 

Lys Arg Ala Thr Glu Leu Thr Pro Pro Asp Val Thr Ala Lys Phe Ala 
115 120 125 

Val Asp Asp Ala Met Ala Leu Ser Phe Pro Asp Gly Ser Phe Asp Val 
130 135 140 

Val Trp Ser Val Glu Ala Gly Pro His Met Pro Asp Lys Ala Val Phe 
145 150 155 160 

Ala Lys Glu Leu Leu Arg Val Val Lys Pro Gly Gly He Leu Val Val 

165 170 175 
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Ala Asp Trp Asn Gin Arg Asp Asp Arg Gin Val Pro Leu Asn Phe Trp 

180 185 190 

Glu Lys Pro Val Met Arg Gin Leu Leu Asp Gin Trp Ser His Pro Ala 
195 200 205 

Phe Ala Ser lie Glu Gly Phe Ala Glu Asn Leu Glu Ala Thr Gly Leu 
210 215 220 

Val Glu Gly Gin Val Thr Thr Ala Asp Trp Thr Val Pro Thr Leu Pro 
225 230 235 240 

Ala Trp Leu Asp Thr He Trp Gin Gly He He Arg Pro Gin Gly Trp 

245 250 255 

Leu Gin Tyr Gly He Arg Gly Phe He Lys Ser Val Arg Glu Val Pro 

260 265 270 

Thr He Leu Leu Met Arg Leu Ala Phe Gly Val Gly Leu Cys Arg Phe 
275 280 285 

Gly Met Phe Lys Ala Val Arg Lys Asn Ala Thr Gin Ala 
290 295 300 



<210> 9 
<211> 31 
<212> DNA 

<213> Synechocystis PCC6803 
<220> 

<221> primer_bind 
<222> <1)..<31) 

<400> 9 

gatatcacca tggccgctgg actgtatctc c 



<210> 10 
<211> 31 
<212> DNA 

<213> Synechocystis PCC6803 
<220> 

<221> primerjbind 
<222> (1) . . (31) 



<400> 10 

gtcgacctta agaatttaag cttgagtggc g 



31 
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FORWARD PRIMER: 02 
CATTTGCTTC TGACACAACT GTGTTCACTA GCAACCTCAA ACAGACACCA 

12mer PRIMER 

TR rtrnnACrT GACTCt TGTG GAGAAGTCTG CCGTTACTGC CCTGTGGGGC 
AAGGTGAACG TGGATGAAGT TGGTGGTGAG GCCCTGGGCA GGTTGGTATC 
AAGGTTACAA GACAGGTTTA AGGAGACCAA TAGAAACTGG GCATGTGGAG 

ACAGAGAAG 
REVERSE PRIMER 011 



FIG. 51 




